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When growing a crystal, a planar front is desired for most of the applications. This plane shape 
is often destroyed by instabilities of various types. In the case of growth from a condensed phase, 
the most frequent instabilities are diffusion instabilities, which have been studied in detail by many 
authors and will be but briefly discussed in simple terms in chapter O. The present review is mainly 
devoted to instabilities which arise in ballistic growth, especially Molecular Beam Epitaxy (MBE). 
The reasons of the instabilities can be geometric, but they are mostly kinetic (when the desired state 
cannot be reached because of a lack of time) or thermodynamic (when the desired state is unstable). 
The kinetic instabilities which will be studied in detail in chapters IV and M result from the fact that 



adatoms diffusing on a surface do not easily cross steps (Ehrlich-Schwoebel or ES effect). When the 
growth front is a high symmetry surface, the ES effect produces mounds which often coarsen in time 
according to power laws. When the growth front is a stepped surface, the ES effect initially produces 
a meandering of the steps, which eventually may also give rise to mounds. Kinetic instabilities can 
usually be avoided by raising the temperature, but this favours thermodynamic instabilities of the 
thermodynamically unstable materials (quantum wells, multilayers ...) which are usually prepared 
by MBE or similar techniques. The attention will be focussed on thermodynamic instabilities which 
result from slightly different lattice constants a and a + Sa of the substrate and the adsorbate. They 



can take the following forms, i) Formati on of misfit dislocations, whose geometry, mechanics and 

ii) Formation of isolated epitaxial clusters which, at 
i the substrate, i.e. dislocation-free (chapter KT). iii) 



VIII 



kinetics are analyzed in detail in chapter 

least in their earliest form, are 'coherent' wit 

Wavy deformation of the surface, which is presumably the incipient stage of (ii) (chapter [IX| J. The 

theories and the experiments are critically reviewed and their comparison is qualitatively satisfactory 

although some important questions have not yet received a complete answer. Short chapters are 

devoted to shadowing instabilities, twinning and stacking faults, as well as the effect of surfactants. 
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I. GROWTH, SURFACE ROUGHNESS AND INSTABILITIES 
A. Growth instabilities 

Crystals are known to be the stable form at low temperature of all materials, except helium under moderate 
pressure. 

Nevertheless, most of the natural materials which surround us are not crystals or are made of very small crystals. 
This fact shows that crystals do not easily grow. The reason is that growing crystals are subject to instabilities which 
introduce crystal defects and eventually result in the formation of polycrystals or even amorphous materials. 

On the other hand, crystals are of great technological interest, because they have reproducible properties. Semi- 
conductor technology uses large quantities of big crystals of silicon which are artificially pulled from the liquid phase 
and which are perfect. This has only been possible because the mechanism of instabilities in this type of growth 
was well understood. Even so, in the case of metals, it is impossible to avoid the formation of a few dislocations in 
a macroscopic crystal. The case of semiconducting elements (e.g. Si), which can be prepared as ideal crystals, is 
exceptional. These elements, even though they can be modified by implantation of appropriately chosen impurities, 
are not sufficient for all technological requirements. Quantum wells, for instance, are made of a slice of a material A, 
of well-defined thickness, sandwiched between two slices of a material S. A usual case is ^4=GaAs and S'=GaAlAs. 
One of the standard techniques to do that is Molecular Beam Epitaxy (MBE) O. 

The word epitaxy designates the adsorption of a crystal on another crystal, with a well-defined relative orientation 
of the two crystals. In most of the examples cited in the present review, both crystals are cubic and their principal 
axes are parallel. 

In MBE, the atoms are ballistically deposited by a beam of molecules or atoms. The required thickness is obtained 
by switching the beam of material S during an appropriate time. If the atoms do not stick at once, they are sucked 
out by a vacuum pump, so that one can for instance deposit n layers of AlAs, then m layers of GaAs and avoid the 
formation of a GaAlAs mixture. 

MBE is a very slow technique, which has the advantage to permit the production of artificial devices which could 
not be produced by other methods. Other examples are multilayers (Fig. fll) or 'superlattices' which are appropriate 
for fundamental research because they are periodic and therefore have, for instance, a simple diffraction spectrum. 
These materials are generally not thermodynamically stable but can be metastable during a very long time once they 
have been prepared. 

Growth by ballistic deposition, as well as from the melt or a solution, is subject to instabilities. These instabilities 
generally appear at the place where the dynamics is fastest, i.e. the surface. The initial stage of the instability is a 
deformation of the growing surface (or 'front') which ceases to be planar and starts forming bumps or holes. Growth 
instabilities may be classified as follows (Fig. @): 

i) Diffusion instabilities (Fig. ga) are typical of growth from the melt or from a solution. In a supersaturated 
solution, dendrites can form because diffusing atoms or molecules tend to go to the nearest point of the solid, and 
therefore to tips which unavoidably form at the growth front. 

ii) Kinetic instabilities (Fig. 0b) which take place because the growth is too fast and the surface has no time to 
find its equilibrium shape, which in the simplest case is a chemically homogeneous plane. 

iii) Thermodynamic instabilities (Fig. 0c) which take place when one tries to make a thermodynamically unstable 
material. 

iv) Geometric instabilities. A typical example is the shadowing instability which occurs in oblique deposition (Fig. 

Diffusion instabilities have been studied in many articles and review papers. For this reason, only a few words will 
be said about them in chapter O, and the present review will mainly be devoted to the other types of instabilities. 
One might think that diffusion instabilities do not arise in ballistic deposition and in particular in MBE. However, 
diffusion docs occur on the surface and is responsible for instabilities as will be seen in chapter M. 

A special type of kinetic instability will be investigated in great detail in chapters IV and M. It occurs when atoms 



diffusing on the surface have difficulties to cross steps. Presumably, the reason of its popularity among theorists is 
that this instability is consistent with the so-called S.O.S. model, a model in which the topology of the solid is fixed, 
atoms are assumed to pile up on preexisting lattice sites. Other instabilities, e.g. stacking faults, are not so easy to 
describe by a simple model. 

Thermodynamic instabilities often result from a difference in the lattice constants of different materials which one 
tries to assemble in a single crystal. If the lattice constants of the substrate and the adsorbate are a and a + da, then 
da/a is called (relative) 'misfit' or 'mismatch'. It will play an important role in chapters VIII , [X and [X| where this 



instability will be studied. Another cause of thermodynamic instability may be of chemical nature. A typical example 
is the solidification of an eutectic solution, which by definition leads to coexisting clusters of two chemically different 



alloys. The cluster size is determined by atomic motion on and near the growth front, and is related to the shape of 
a front, which becomes more or less complex and typical of a growth instability |2J]. 

Strictly speaking, thermodynamic instabilities do not appear only during growth. However, in practice, they mainly 
appear at high enough temperature, and in particular, during growth, while the evolution of a solid toward equilibrium 
at room temperature is often geologically slow. 



The shadowing instability will be much more briefly addressed in chapter VI , as well as kinetic instabilities resulting 



from stacking faults and polymorphism, which will be treated in chapter XII . The very unequal length of the various 
chapters corresponds partly to the unequal competence of the authors, partly to a different state of the art. In cases 
which are still poorly understood, we shall mainly give references, essentially experimental ones. 

In the case of non-crystalline materials, the bumps and holes which are formed at the surface are macroscopic. 
In the case of crystals, which is of interest in this review, the modulation of the surface may be microscopic. It is 
generally the case, for instance, when dislocations are forme d. Di slocation formation, which may be in competition 



with other types of instabilities, will be addressed in chapter VIII . 

Instabilities are often characterized by a sudden burst. Before the instability takes place, the surface is smooth. 
Then, suddenly, an instability develops and its amplitude increases dramatically for short times, for instance expo- 
nentially with time. This fast initial increase gives no information on the final state of the system, which may be 
a spatially periodic modulation, or a general collapse, or chaotic. Only in the case of a sharp initial increase of the 
perturbation the word 'instability' will be used. It will not be used for instance if the fluctuation amplitude 5h(t) of 
the surface increases as a power of the time t, 5h(t) ~ t x l z . In the latter case, we will just say that the surface is 
'rough'. 

B. Thermal roughness and roughening transition 

A rough surface, with a roughness which increases in time, is frequently predicted for a growing object 13-0] when 
the growth rate undergoes stochastic fluctuations, as is usually the case. But even at thermal equilibrium, i.e. in the 
absence of growth, the surface of a crystal can be rough. Although the minimization of the surface (or 'capillary') 
energy corresponds to a plane surface, thermal fluctuations can produce some 'roughness'. At low temperature, this 
roughness has only a microscopic scale, and the surface is said to be 'smooth'. The roughness becomes macroscopic, 
and the surface is then said to be 'rough', above the so-called roughening temperature Tr which depends very much 
on the surface orientation |3|-|5|. High symmetry surfaces, (001) or (111), of fee metals are smooth until very near the 
melting temperature, while high index surfaces, e.g. (1,1,11), have very low roughening temperatures which cannot 
be observed because thermodynamic equilibrium cannot be reached |4|. But thermal roughening can be observed on 
(113) or (110) metal surfaces. High symmetry surfaces of semiconductors are also believed to be smooth at equilibrium 
until very high temperatures, higher than those used for MBE. 

Elasticity is generally ignored in theoretical treatments of the roughening transitions which have been reviewed for 
instance by Nozieres Q , Pimpinelli & Villain Q , Barabasi & Stanley || . Some properties of crystal surfaces above 
and below T R are summarized in table S. An important property, which will be of interest in chapter IX , is that the 



energy of a plane surface, which above Tr is an analytic function of its orientation, becomes non analytic below Tr. 
In the case of a high symmetry surface, this is a consequence of the positive step free energy per unit length 7: if the 
surface is disoriented by an angle 8, the energy cost is the product of 7 by the number of steps, which is proportional 
to \6\, which is a non analytic function for 9 = 0. Such a surface with a non-analytic dependence on the orientation 



is called singular. Similarly, a height modulation of wavelength 2ir/q (see chapter IX) is an analytic function of q for 
T > Tr and non- analytic for T < Tr. 

Another property of a smooth, infinite, high symmetry crystal surface at equilibrium is that its height cannot change 
continuously. Indeed a continuous change requires the creation of an infinitely long step which has an infinite energy 
On the contrary, the height of a rough surface can change continuously without paying any free energy through a 
motion of the steps. A precise, renormalisation group description has been given by Tang & Nattermann |h]. 

The experimental determination of Tr is not always easy B. The safest method uses the equilibrium shape of 
crystals. The presence of facets of a particular orientation at a temperature T indicates that Tr > T for this 
orientation. Unfortunately, this method is only applicable if Tr is pretty close to the melting temperature Tm- For 
instance, for Si, where Tm — 1413°C Heyraud et al. J7]] have very recently found Tr = 1370°C for the {110} face, 
T R = 1340°C for the {113} face, and T R > 1400°C for the {111} face and add: "The case of {001} is much less clear, 
and is left to a future paper ...". This sentence which reflects the hope of a better future also reflects our present 
ignorance of thermodynamic properties of the most common surfaces. 

To conclude this paragraph on roughness, one can say that a 'rough' surface, as the man in the street would call 
it, can originate i) either from stochastic fluctuations or ii) from an instability. In the latter case, the mathematical 



description is provided by deterministic equations of motion. In the first case, the stochastic fluctuations can be either 
thermal fluctuations at equilibrium, or for instance the fluctuations of the beam in MBE (the so-called 'shot noise'). 
The mathematical description of beam fluctuations requires the introduction of a stochastic term in the equations, 
while thermal fluctuations are just taken into account by the Boltzmann factor in Gibbs's canonical distribution. We 
prefer to restrict the use of the word 'roughness' to that which arises from stochastic effects, and to use the expression 
'instability' when the roughness arises from a deterministic mechanism. Some references to stochastic effects will be 
made in the next chapters, but most of the present review will be devoted to instabilities. 

II. SNOWFLAKES, DIFFUSION INSTABILITIES AND DLA 

The goal of this chapter is to give an intuitive introduction to diffusion instabilities. The mathematical treatment 
will be found in review papers and books p-|l3||. 

A crystal growing from a dense phase often forms 'dendrites'. Snowflakes (Fig. pp) are the most familiar example. 
However, their formation in clouds is a complicated process [g[, since clouds are made of supercooled liquid droplets 
coexisting with solid particles and water molecules which travel from the former to the latter. It is simpler to consider 
growth from a homogeneous phase, e.g. a solution. As an example, Fig. Ep shows a pattern which has a resemblance 
to a snowflakc. It is not a crystal, but similar instabilities arise in crystals growth as well. Diffusion instabilities arise 
(Fig. 0a) because, for some reason (e.g. thermal fluctuations) small protruding parts appear. If there are protruding 
parts, the molecules which diffuse through the solution to the solid surface go preferably to those parts (Fig. |^a) 
because paths leading to protruding parts are on the average shorter. Thus, protruding parts become more protruding 
and form dendrites. 

This mechanism can be implemented by simulations. In the simplest model ('Diffusion limited aggregation' or 
DLA [ [L4|JL5| ) molecules are assumed to be independent in the fluid phase. They diffuse randomly from a very large 
distance, and stop moving as soon as they meet the solid. This model gives rise to very irregular shapes (Fig. 0). 

The striking difference between the irregular cluster of Fig. [| and the fairly regular shape of snowflakes is partly 
due to the hexagonal symmetry of ice crystals. Another factor which favours order is the possibility of atom diffusion 
along the solid surface, which is ignored in DLA. It is important in the case of snow, and according to Mason |lj] 
surface diffusion is the reason of certain shape transitions of snow with temperature. Finally, interactions between 
atoms, which are neglected in the DLA model, make growth more deterministic. An extreme case is growth from the 
melt. Since the fluid is almost incompressible, the above picture, based on matter diffusion, is not valid. Actually, 
the quantity which diffuses, at least in a pure melt, is not matter, but energy. The equation which represents energy 
diffusion in the liquid, or matter diffusion in a vapour (or in a solution) is the diffusion equation 

dp/dt = DV 2 p (1) 

where p(f, t) is the density of energy in one case, of matter in the other case. A third case of interest is solidification 
from an impure melt. Then the relevant diffusing quantity is the density of impurities, whose diffusion is much slower 
then energy diffusion. The relevant process is the slowest one. 

Equation (nl) must be coupled with appropriate boundary conditions at the interface, which is assumed to be a 
continuous surface ]ll] — [l^ Jl7| , [l8t ] . If one wishes to model 'directional solidification' (the usual way to pull a crystal 
from the melt, e.g. silicon) the initial condition is a plane surface. If the pulling velocity is low enough, the surface 
remains a plane. For a given temperature gradient, the instability sets in [ [L7[ if the pulling velocity v is larger than a 
critical value v c . It is called the Mullins-Sekerka instability. If v is not too large, the ultimate state of the surface is 
a periodic modulation jnTO|l7| (Fig. 0c) . The contrast between this smooth pattern and the DLA pattern of Fig. 
[| is striking. 

The DLA cluster is indeed a fractal [19] . This means that the relation between its radius R and its mass M is not 
M = Const x R d , where d = 3 is the dimension of the space, but (for large R) 

M = Const x R d J (2) 

where dj is called an 'effective fractal dimension' pQ] , Usually, it is not an integer and is smaller than the space 
dimension d, which is generally d — 3 in real materials although simulations are often made in d = 2 since they are 
easier. In two-dimensional DLA EG], df is close to 1.7. The radius R can be defined at any time as the distance from 
the center such that a diffusing atom has a probability lower than a certain value (e.g. 0.1) to reach it. 

One can wonder whether continuous theories based on (TH) can also produce a fractal. To our knowledge, there 
is no clear theoretical answer, but Arneodo et al. |21| performed experiments which can reasonably be described by 
continuous models and were able to produce fractal objects quite analogous to DLA clusters pl[|. 



The various patterns which arise from diffusive instabilities result from differences in the details. For instance, in 
directional solidification there is a temperature gradient which has an important stabilizing effect and has no equivalent 
in the experiments of Arneodo et al. pl| which, as reported above, yield fractal objects. However, the onset of the 
instability results from equations which, for a given initial condition (e.g. a plane front z — vt) are very similar in 
all the cases. At the beginning of the instability, the deviation dz(x,y,t) from the planar shape is indeed very small 
and the equations can be linearized with respect to Sz(x,y,t). A Fourier transformation in the (x, y) space yields 
dzk(t)/dt — uJkZk, where the coefficients ujk depend on the model, and there is an instability if at least one of them is 
positive. This method is called a 'linear stability analysis'. It is sufficient to prove that a plane front is unstable, but 
does not predict the shape of the resulting dendrites which can only be deduced from a nonlinear treatment. 

Before leaving diffusion instabilities, we come back to snow. The formation of snow crystals in clouds proceeds in 
three steps || : i) Heterogeneous nucleation of an ice particle by deposition from the vapour phase on an impurity 
called 'ice nucleus' or by nucleation in a supercooled water droplet by an ice nucleus, ii) Growth of the ice particle 
by deposition of water molecules evaporated from liquid droplets and diffusing through the vapour phase, until the 
diameter reaches about 0.2 mm. iii) Growth by sticking of supercooled liquid water droplets on the ice particle or by 
aggregation of ice particles. Examples of ice nuclei are small crystals of Agl. The typical diameter of a water droplet 
in a cloud is between 10 and 100 /zm. Although diffusion plays a part in these processes, the simple scheme of a 
diffusion instability is not applicable. 

In view of the considerable amount of work devoted to diffusion instabilities, nothing more will be said about this 
topic in the present review. Instead, we shall describe instabilities which arise when the atoms do not diffuse toward 
the crystal, but travel 'ballistically', i.e. without collisions, from a source, and then form epitaxial layers. 

We have nevertheless devoted this short chapter to diffusion instabilities for several reasons. First, for the com- 
pleteness of this review on growth instabilities. Second, instabilities which arise from diffusion on the moving front 
will be encountered in chapter M. Third, the methods which are useful to investigate diffusion instabiliti es ca n often 
be used also for other instabilities. For instance, linear stability analysis will be used in parts [TV], |\] and IX E| . 



III. GROWTH FROM DILUTED VAPOUR AND MBE 
A. MBE, what is that? 

The remainder of this review is devoted to techniques where the deposition is ballistic rather than diffusive, so 



that no instability can arise from diffusion in the fluid phase. Apart from the short chapter VI, the attention will be 
concentrated on Molecular Beam Epitaxy (MBE) and similar techniques. 

Among all the advanced semiconductor growth techniques, Molecular Beam Epitaxy provides the greatest ease of 
growing complex semiconductor multilayer structures with a precise control of the thickness (up to the monolayer 
scale accuracy), the composition and the doping of the involved layers E2]. In this technique, growth proceeds under 
ultrahigh vacuum conditions (pressures < 10~ 8 Pascals) by the condensation of thermal energy molecular beams on 
an underlying single crystal substrate. The substrate has a strong influence on the growth. Generally, the deposited 
film adopts as far as possible the orientation and crystallographic characteristics of the substrate. As a matter of fact, 
when thermal atoms or molecules arrive on a substrate, they diffuse until they reach an adequate crystallographic 
site where they can be incorporated (see Fig. ||). One can distinguish two main cases, i) On singular surfaces, atoms 
or molecules can meet each other and aggregate to form two-dimensional nuclei. Then, these two-dimensional nuclei 
grow by incorporating further atoms or molecules and finally coalesce to form new epilayers. This growth mode called 
'two-dimensional nucleation growth mode' can be monitored in-situ on an atomic scale by Reflection High Energy 
Electron Diffraction (RHEED) intensity oscillations. A RHEED intensity maximum corresponds to a single atomic 
or molecular layer completion because it is the surface roughness minimum, ii) On vicinal surfaces (see definition in 
chapter |V|), this growth mode can only exist if the terraces are large enough with respect to the diffusion length Id 
(see App. |B|). Otherwise, atoms and molecules are incorporated at existing steps, whose forward motion results in the 
growth of the crystal. This second growth mode is called 'step flow'. In both cases, most of the incorporation sites 
are supplied by steps which result either from two-dimensional nuclei edges or from vicinal surface steps and their 
number and distribution are important features of the growth front. 

MBE growth can be influenced by many parameters, e.g. the incident beam rates which determine the relative 
concentration of adatoms at the growth front, the adatom re-evaporation rates, the substrate temperature which 
determines the adatom migration velocity but also the surface chemical and/or structural properties such as roughness, 
composition or surface reconstruction. 



B. MBE of metals 

The case of metals is conceptually the simplest. The naive representation of atoms as hard spheres is not too bad, 
and the surface is in many cases not reconstructed. 

Thin metallic layers and multilayers of pure elements or alloys are interesting in several respects: magnetic prop- 
erties, mechanical and elastic anomalous behaviour f23[| . enhanced catalytic properties J24J. A great deal of attention 
has been devoted in recent years to magnetic properties of metallic thin layers and multilayers. Among them, Giant 
Magneto- Resistance and Perpendicular Magnetic Anisotropy may lead to technological improvement of the magnetic 
recording techniques. The possibility of a Perpendicular Magnetic Anisotropy as a consequence of the reduced sym- 
metry, was first pointed out by Neel. Strong experimental evidence for this anisotropy was found by Gradmann's 
group p5| . In multilayers, composed of magnetic and non-magnetic metals, interface anisotropy is expected, too p6| . 
The magneto-elastic effect, which involves the same fundamental energy, i.e. the spin-orbit coupling, may be the 
leading factor |27|]. Strong magneto-crystalline anisotropy has also been observed in metallic alloys layers grown by 
sputtering or Molecular Beam Epitaxy when an anisotropic structure is obtained with the easy magnetic axis perpen- 
dicular to the layer (hexagonal, tetragonal, etc.) fl28| , [29f . It has also been shown that antiferromagnetic coupling may 
occur between magnetic layers through non-magnetic layers, and the magnitude of this effect oscillates with the non- 
magnetic layer thickness |p0| . This magnetic coupling gives rise to Giant Magneto- Resistance |31j. The occurrence 
and magnitude of these properties depend dramatically on the quality (roughness, etc.) of the layer. The instability 
of the growing front may alter this quality. 

C. III-V compounds 

In the remainder of this chapter, attention will be focussed on III-V semiconductors which have been most ex- 
tensively studied because of their technological interest. In this case, the substrate temperature and the V-III ratio 
are chosen in such a way that the growth rate is controlled by the group-Ill element incorporation rate (close to 
one), the group-V element being incorporated only if chemically reacting with group Ill-elements. Thus, increasing 
the group-Ill element deposition rate increases the growth velocity. On the other hand an increase of the V-element 
beam rate decreases the surface diffusion constant. In this case, the choice of a particular V/III ratio associated 
with a particular substrate temperature corresponds to the choice of particular growth kinetics. Since the group V 
element evaporates when it is not at once incorporated, the MBE growth of III-V compounds can be described with a 
reasonable approximation by models in which a atoms of a single species are considered [22 3^-|34|. This species is the 



group III element, whose deposition and diffusion on the surface determines the growth rate and the type of growth. 
Such models will be used in the following chapters and are applicable to GaAs, for instance, if the word 'atom' is 
everywhere replaced by 'Ga atom'. However, one has to be careful when using this approximation. For instance, as 



noted by Tersoff et al. 1 35 1 the temperature dependence of the Ga adatom density po at equilibrium is not given, as 



it would be in a one-component system, by 

PQ = a- 2 e-? w <> (3) 

where a 2 is the area per atom and Wq a constant. The reason is the following: terraces contain an equal number of 
Ga and As atoms, while Ga adatoms move presumably alone, without the company of an As atom. Therefore, if a 
Ga adatom is released from a terrace, a consequence is (apart from the energy cost) the release of an As atom from 
the same terrace into the vapour. Therefore, there is an entropy gain which modifies Wq by a temperature-dependent 
amount. Treating the As vapour as an ideal gas, Tersoff et al. |35j obtain 

Po=a- 2 (P/P r 1/m e- l3W ° (4) 

where Wq is a constant, P is the As pressure, m = 2 for the diatomic vapour As2, and B 



Pa{T) = i-^tr- (5) 

where M is the mass of the molecules of the vapour (150 times the proton mass in the case of AS2). Thus, if (0) 
is replaced by (||) in a single component model. Wo depends on temperature. For the values used by Tersoff et al. 
(T = 850 K and P = 10~ 4 Pascals) formula (|) yields (P/P ) _1/m ~ 10 9 , a remarkably large value. If, instead of 
being at equilibrium, the system is subject to MBE growth, an effective pressure can be defined |$5| as "that pressure 
which would give the same As desorption rate in equilibrium as actually occurs" . Experimentally, Tersoff et al. find 
a factor 10 15 unfortunately far from the above mentioned factor 10 9 . 



D. MBE, why? 

The merit of MBE is the possibility to prepare a wide range of III-V semiconductor materials^ with tailor-made 
lattice constants or electronic properties. For instance, in a ternary compound A^Bi-xC, the choice of x allows for 
tuning the lattice constant so as to make possible epitaxy on a given substrate. Instead of the lattice constant, one 
can tune the electronic band-gap in order to obtain the desired optic or electronic properties. Examples are provided 
by the ternary compounds with A=As, B=P and C=Ga. Pure GaAs (x = 1) has a direct band gap in the infrared, 
while GaP an indirect band gap in the green, and their mixture GaAso.6Po.4 has a direct band gap emitting in the red. 
This film growth technique provides thus a flexible and powerful tool to select a particular wavelength by choosing 
the alloy composition (see Fig. 0) and/or to engineer the band structure by growing layers of predetermined and well 
controlled alloy composition. 

These materials (binary compounds or alloys) can be combined in single or multilayer 'heterostructures' made of 
different semi-conductors which have different electronic properties (e.g. band gap energy) but closely matching lattice 
parameters. 

The simplest heterostructures are quantum wells, which are used in optoelectronics and constitute the active zone of 
optoelectronical devices (lasers or optical amplifiers). In a quantum well, the charge carriers (electrons and holes) are 
confined between two planes (see Fig. Bk). Examples of materials appropriate to make a quantum well are Gai-^In^As 
and Ali-aJn^As. These two alloys are lattice-matched on an InP substrate for x = 0.53 and x = 0.52, respectively, 
but their band gap energy varies from 0.76eV (1.65/im) for GalnAs to 1.46eV (0.85/im) for AlInAs. 

These quantum wells themselves can be organized in complex superlattices. It should be noted that band dis- 
continuity resulting from the alignment of different band structures can induce another kind of interesting carrier 
confinement: the junction often includes a step and a notch in the valence or the conduction bands. A typical exam- 
ple is the heretojunction between highly n-type AlGaAs and lightly doped GaAs (Fig. pp). For most technological 
applications, the important point is to make perfect interfaces between materials free of structural or chemical defects. 



However, it is actually a difficult task because of the surface segregation 1 36 of the binary with the lower surface 
energy which acts against the chemical abruptness of the interface. 

E. Instabilities 

MBE allows to make chemically unstable alloys. This challenge to chemistry is not free from any danger. The 
most usual danger is a 'chemical instability', i.e. composition modulations (alternating AC rich and BC rich domains) 
appear during the growth, either parallel or perpendicular to the growth plane. As an example, Fig. R shows the 
surface of unstable alloys of composition close to Alo.5Ino.5As, which are unstable and tend to decompose into two 
phases of composition close to AlAs and InAs. 

From a technological point of view, a chemical instability leads to a non-uniformity of the band-gap and thus induces 
an undesirable broadening of the photoluminescence line width and a lowering of the electron mobility. But, on the 
other hand, the chemical instability of the alloy can be used to obtain spontaneous nanoscale strained structures (wires 
or dots) if correctly controlled. The deviation from randomness can also take the form of an ordered phase. Both 
composition modulation (Fig. |7|) and long range ordering (Fig. g), have been observed and sometimes they coexist in 
the same sample. 

Once it has been grown, the bulk material can survive during centuries at room temperature, because bulk diffusion 
is very slow. In contrast, surface diffusion is pretty fast at the growth temperature, and must be so to ensure a good 
crystal quality. 

Even in the absence of any chemical instability, a multilayer or a double layer can be unstable if the lattice constants 
of the pure materials are too different. When a semi-conductor is grown lattice-mismatched with the substrate, a 
thin pseudomorphic layer is expected to grow first uniformly and commensurably with the strain energy increasing 
linearly with thickness up to a critical threshold. Below this critical threshold which drastically depends on the growth 
conditions, a compressed or stretched pseudomorphic layer is obtained, with a modified band structure which can 
be useful for a wide range of practical applications. Beyond the threshold, dislocations or other types of instabilities 
usually appear. 



1 II-VI semiconductors can also be prepared and are currently studied in laboratories, but their industrial use is not so wide 
as that of III-V materials. 



While chemical instabilities will not be addressed in the remainder of this review, elastic instabilities will be studied 



in detail in chapters VII] and following 



IV. THE EHRLICH-SCHWOEBEL INSTABILITY ON A HIGH SYMMETRY SURFACE 

A. Introduction 

In the present chapter and in the next one we will give a description of kinetic instabilities occurring in homoepitaxial 
growth, due to the so-called Ehrlich-Schwoebel (ES) effect which makes the sticking process of an adatom to a step 
asymmetric. This effect was discovered thirty years ago by Ehrlich and Hudda p7[ , who observed —through a field ion 
microscopy technique— that tungsten atoms diffusing on a terrace were repelled by a descending step. It was shown 
by Schwoebel and Shipsey |38| that this effect is stabilizing on a vicinal surface, while its destabilizing character for 
a high symmetry surface was pointed out by Villain only a few years ago ]39| ]. 

For a quantitative analysis of this mechanism, microscopic details are extremely important: lattice structure, surface 
orientation, step orientation, sticking process. For example, Stoltze Q has performed a detailed study of the surface 
energetics of several fee metals, based on the effective medium theory. According to this work, an ES barrier exists for 
any surface and step orientation. The extra energy barrier is of the order of 100/300 meV (a fraction of the diffusion 
barrier on a flat surface) and it depends on the sticking process, which may take place through a hopping over the 
edge or through an exchange mechanism with an edge atom. The former is almost always favorable for the (100) and 
(111) surface orientations, while the latter prevails in several cases for the (110) orientation. 

Ab initio calculations show that the 'real' picture may be even more complicated. For Al(lll) —which has an 
extremely low diffusion barrier: 40 meV— Stumpf and Schcfflcr |41|] have shown that the exchange process is favorable, 
but the extra energy barrier is very small, and perhaps smaller than the 'long-range' attraction energy between step 
and adatom. For Au(lll) p2] the static barriers (at T — OK) are fairly large and almost equal for the two mechanisms, 
but simulations at T > 450K show that exchange does not require an extra energy. 

The experimental picture is not universal. Field ion microscopy studies by Ehrlich and collaborators (see the 
paper |43j by Kyuno and Ehrlich and references therein) show that an adatom approaching a descending step may 
feel a step-edge trap rather than a step-edge barrier, which may reduce the diffusion bias (the up-hill current induced 
by the ES effect: see below). 

Several authors |44J have proposed and/or performed an indirect evaluation of the additional step-edge barrier 
through the study of the nucleation rate on top of islands. The method is applied to metal epitaxy (Fe, Ag, Pt) and 
the results generally attest the existence of such a barrier. For instance, Morgenstern et al. p5J find a Schwoebel 
barrier equal to 0.13 ± 0.04 eV for Ag(lll). 

In contrast, the existence and the importance of step-edge barriers in semiconductor materials is a much more 
debated question. These barriers have been invoked to explain the roughening of Si(100) Eq] before the crystal- 
amorphous transition; on the other side a recent and detailed study |47| of the growth of Si(lll) shows a morphology 
which does not agree with the unstable evolution due to ES barriers. Nevertheless, some calculations |4j| based on 
empirical potentials J49[ do give a finite barrier even for the (111) orientation. 

A surface evolution in agreement with step-edge barriers is found for Ge(001) [pfJMj, even if the barriers are 



evaluated to be fairly weak. Also for GaAs there is some evidence [|32|,|52 53 supporting their existence. 

The efficiency of the ES barrier clearly depends on the temperature: the ES instability —which hinders the attain- 
ment of the equilibrium— manifests itself at low T . 

The name Ehrlich-Schwoebel instability is generally used in relation to growth of a high symmetry surface: in 
this case it gives rise to the formation of three-dimensional structures rather than to a layer-by-layer growth. This 
instability is studied in the present chapter. 

On a vicinal surface, where growth proceeds via step flow, the same microscopic mechanism (the ES effect) de- 
termines meandering of the steps, usually referred to as Bales-Zangwill (BZ) instability p4J], which resembles the 
fingering instabilities discussed in chapter 0. It will be studied in the next chapter. 

B. Continuum description of the surface 

The evolution of the surface shape may be attributed to two types of mechanisms, i) Deterministic ones, which 
tend for instance to reduce the free energy and to restore thermal equilibrium, ii) Stochastic processes related to the 
randomness of the deposition, of atom diffusion, of nucleation. . . . The stochastic part is usually called 'noise', in 
analogy with noise in electric circuits for instance, which is caused by a stochastic motion of electrons. Thus, if z[x, t) 



denotes the local height of the surface profile in x = (x,y), its temporal variation is expected to satisfy an equation 
of the form: 

d t z(x, t) = „4({z}) + noise (6) 

where „4({z}) is a deterministic functional of z(x, t) and dt = d/dt. 

If volume diffusion is neglected, the dynamics of the surface is governed by three microscopic processes: deposition, 
surface diffusion and desorption. By definition, surface diffusion conserves matter on the surface; deposition from the 
vapour phase is the source of new matter, but generally it is a process which does not depend on the surface profile: 
it contributes to the local velocity dtz through a term -F(af, t) = Fq + 5F(x, t). We can get rid of the average value Fq 
by simply redefining the local height: z(x,t) — > (z(x,t) — Fot), while SF(x,t) contributes to the noise. In some cases 
the deposition rate does depend on the surface profile, for example when the average flux impinges sideways upon the 



surface. This may give rise to shadowing instabilities, which will be discussed in Chap. VI. Another possibility is a 
rate of attachment of the impinging atom which depends on the local environment. 

Finally, desorption can be important, especially in semiconductors, whose typical growth temperatures are higher 
than for metals. The description of a high symmetry surface will be worked out mainly under assumption that 
evaporation is negligible. 

Once that matter is conserved on the surface, the next step is to assume that overhangs, voids and intcrstitials are 
not relevant, so that volume as well is conserved. If it is so, the functional .A({z}) must be the divergence of the 
surface current and therefore: 

dtz(x, t) = — V • j + noise (7) 

where the current j is a functional of the surface profile z(x,t), similarly to „4({z}) in Eq. (0). An important 
question is whether j is a local or nonlocal functional. In other words, is it sufficient to know the surface profile in x 
{z(x, t), Vz, V 2 z, . . .) to determine the growth velocity dtz, or is it determined by the surface profile in a finite space 
around x ? In principle, epitaxial growth is not a local process, mainly because diffusion is not. In this sense, Eq. (M) 
must be understood as an equation which is valid on large space and time scales: space scales larger than the typical 
distance between steps and time scales larger than the filling time of one monolayer (= \/Fq). Nevertheless, even so 
a nonlocal description may be necessary: we will come across one case later on. 

Because of the translational invariance in the z-direction, the current j as well as the quantity A in the general 
case of non conserved volume, does not depend explicitly on z(x,t), but only on its derivatives. So, it may be useful 
to think of j (or A) as a functional of the local slope m — Vz. By taking the gradient of (fn), we obtain: 

d t m = -V(V • J) + V(noisc) (8) 

= -V 2 J - V A (V A J) + V(noise) (9) 

In 1+1 dimensions the current and the slopes are scalar quantities and therefore the curl term in the right-hand-side 
disappears (d x = d/dx): 

d t m(x, t) = -d 2 x j + 9 X (noise) (1 + I dim) (10) 

If the current is derivable from some effective free energy T 

jm = - s £ (id 

the evolution equation for m may take or not the form of the Cahn-Hilliard |55|j equation, depending on T . 

In 2+1 dimensions the curl term vanishes if the current is nonrotational: V A j = 0. Since j is a functional of rh 
and V A rh = V A (Vz) = 0, the current is expected to be nonrotational only if it is linear in the slope, but this is 
generally true only in the earliest stages of growth, when \rh\ <§C 1. Therefore V A j ^ as a rul e. All t he same, the 



evolution of the surface may have many similarities with a phase separation process (see chapter IVF 2). 



C. Different sources of noise 

Till now the possible sources of noise have not been indicated explicitly. In Molecular Beam Epitaxy there are three: 
i) Fluctuations in the incoming flux (the so-called shot noise); ii) Fluctuations in the diffusion current (conserved noise); 
iii) Fluctuations in the nucleation process of new islands. 



The first two have been well studied in the context of several different models pLpf : generally speaking, their effect 
is to roughen a surface which would be flat in the absence of noise. Here the typical situation we want to study 
is different: the surface is deterministically unstable and therefore the 'deterministic roughness' may overcome the 
stochastic one, induced by noise. Furthermore, if the dynamics resembles a phase separation process, noise is known 
to be an irrelevant parameter, at least in 2+1 dimensions. A second remark on shot noise is in order: roughness 
induced by shot noise is in part healed by surface diffusion. It is therefore important to understand on which time 
and length scales roughness is relevant. Here we report a qualitative argument due to Villain [p9|], which will also be 
used in the context of unstable growth of a high symmetry surface (see App. KJ). 

The average number of atoms falling on a region of size £ during time t is: N — Fot£ and its fluctuation due 



to a Gaussian shot noise is: AN « VJY = ^/Fot£ d . Such 'volume' fluctuations determine a 'height' fluctuation 
Ah = AN/£ d ss y / h/d d , where h = F t is the average number of deposited layers. The central question is: What is 
£ ? In the context of a surface which is deterministically stable, £ is the diffusion length £jj (see App. |b|): diffusion is 
supposed to heal shot noise just on this scale. If £d ~ 10 2 -j- 10 3 , in two dimensions we will have Ah « 1 after the 
deposition of 10 4 -i- 10 6 layers, which is quite a large value. 

The contributions of shot noise and diffusion noise to Eq. (fa) are generally written as 5F(x, t) and t]d(x, t) respec- 
tively, with: 

(8F(x, £)) = (5F(x, t)5F{x', if)) = 2F S(x - x')5(t - t') (12) 

{ VD (x,t))=0 {i lD (x,t) VD (x\t l )) = -2DV 2 5{x-x')8(t-t l ) (13) 

where D is the surface diffusion constant. 

Concerning nucleation, it is due to an encounter of two (or more) adatoms during the diffusion process. So, we 
might ask why nucleation noise is something different from diffusion noise. It is indeed possible to separate the two 
contributions in the following way: on a region of high slope, nucleation is almost negligible (see App. 0), but diffusion 
noise still exists; conversely, in the continuum limit a^Owc will see that diffusion noise is negligible while nucleation 
noise survives. 

A further feature which characterizes nucleation noise with respect to shot and diffusion noise is that for the latter 
ones it is clear the meaning of 'absence of noise'. In other words, it is obvious which is the corresponding deterministic 
model: 6F(x,i) = and r]D(x,t) = 0. But which is the deterministic model of nucleation? We will show that even if 
a model can be proposed, it may give rise to a nonanalytic profile in the nucleation sites. 

An important question is whether nucleation noise is enough to roughen the surface by itself. The answer, according 
to the works by Elkinani and Villain |36| and Wolf et al. (57]], is 'no'. Even more important, there is numerical 
evidence p8[ that nucleation noise is able to heal surface defects at small length scales. However, a deeper analytical 
comprehension of nucleation noise would be necessary. Preliminary studies will be found in Refs. [57| , p^| . 

D. The surface current j 

This chapter addresses a problem which is common to several branches of physics: How to derive a continuum 
equation, valid at large length scales, starting from a microscopic and discrete point of view? Here the situation is 
particularly difficult because the surface is made up of two different microscopic objects —steps and adatoms— whose 
characteristic time scales are fairly different: if T a d is the typical diffusion time to a neighbouring site and T s t ep is the 
time necessary to advance the step of one lattice constant, it is easily found that r a d ~ a 2 /D and T s t ep ~ l/(Fo£a d ~ 1 ), 

where £ is the typical distance between steps. The condition T step ^ r a d reads (we put a = 1) £ <C (D/ Fq) = £^ , 
with 1/(5 > 2 (see App. |b|). Since £ must be smaller than £o and £u ^> 1, the above condition is certainly fulfilled. 
Therefore adatoms see stationary steps during their motion and steps feel adatoms only as an 'average' current: the 
Burton, Cabrera and Frank theory |6(J] of crystal growth is indeed based on this quasi-stationary hypothesis. 

A possible approach to the problem of determining a continuum Langevin equation is to start from the microscopic 
rates for the single processes (deposition, diffusion, evaporation ...) in a solid-on-solid model and write down a master 
equation for the discrete variables hi (the height of the i-th column in the SOS model). This procedure generally 
requires the truncation of an infinite hierarchy of equations to arrive to a set of discrete Langevin equations, one for 
each h^ afterwards, it is necessary to find a continuum limit in the space variables and this requires a regularization 
scheme for step functions. This method has been used by several authors: Plischke et al. pit] for Metropolis dynamics, 
Zangwill et al. fJ3[ , Vvedensky et al. |32| and more recently by Pfedota and Kotrla Q] for Arrhenius dynamics. 

As pointed out by the same authors this approach suffers from important problems, the major one regarding the 
regularization scheme for the step function: no rigorous procedure exists, different schemes give different results and 
in the final results some coefficients entering in the regularization procedure which should depend on the flux Fq 



remain undetermined. A critical discussion of the master equation approach in the case of equilibrium relaxational 
dynamics can be found in Rcf. |64| . 

A completely different approach uses the method of Burton, Cabrera and Frank |6fl]: one writes the diffusion 
equation for adatoms on a given terrace and solves it with suitable boundary conditions at steps: this gives rise to 
a current of adatoms, from which step velocities can be evaluated. Once they are known, one can study numerically 
the evolution of the surface or one can try to obtain a continuum evolution equation through some 'coarse graining' 
procedure. This method is generally restricted to sufficiently large slopes: in fact it is not easy to introduce nucleation 
in this kind of approach. 

Instead of giving further details on the different methods which can be employed to obtain some expression for j, 
we prefer to introduce the different terms phenomenologically and justify them. 

1. The Ehrlich-Schwoebel current 

The first place clearly deserves the term which is responsible for the instability treated in the present chapter: 
the current induced by the Ehrlich-Schwoebel effect. Let us imagine a surface which has a pyramidal structure and 
therefore top, bottom and vicinal terraces (see Fig. [lC]). 

Such a 'three levels' structure exists even in the absence of any step-edge barrier (see, for example, the simulations 
reported in Fig. 7a of Ref. |56[ ). If an ES barrier exists, it has a double effect: i) on a top terrace, since atoms 
are hindered from descending, the probability of nucleation is increased and this makes pyramids higher; ii) on a 
vicinal terrace, adatoms prefer sticking to the ascending step and therefore existing holes are not filled. Both effects 
destabilize the flat surface, giving rise to three-dimensional growth. From a continuum point of view, the above 
mechanisms give rise to a nonequilibrium current jes which depends on the slope of the surface. This is of extreme 
importance, because all the other terms in the current will be seen to vanish on a surface of constant slope. 

Let us start by considering a 1+1 dimensional surface, which is supposed to have a positive slope. On a terrace 
of size £ we solve the diffusion equation: dtp = F$ + Dd^p in the quasi-static approximation (dtp — 0), which 
is justified by the relation T step ^> T a d- This gives p(x) = A + Bx — (Fq/2D)x 2 , where constants A and B are 
determined through boundary conditions at steps in x = ±£/2. Here, the sticking coefficients k + and fc_ (respectively 
from the upper and lower terrace) are introduced: an infinite coefficient means that the adatom is automatically 
incorporated as it reaches the step, while a vanishing one means the adatom is unable to stick. The existence of a 
current jes is due to an asymmetry between k + and £;_: in fact, such an asymmetry makes asymmetric the density 
of adatoms with respect to the center x = of the terrace, i.e. B ^0. Since the microscopic current on the terrace 
is: jmic = ~Dd x p = -D[B - (F /D)x], its average value is simply: j E s = (jmic)ter = —DB. 

If the equilibrium density is neglected with respect to the density due to the external flux, boundary conditions 
read: Dp'(±£/2) = ±k^p(±£/2). In the simplifying hypothesis that &_ = oo we can introduce a single parameter, 
the so-called ES length: £es — D/k+, and the boundary conditions read p(£/2) = for the ascending step and 
p'{-£/2) = -p(-£/2)/£ E s for the descending one. Under these conditions it is found that B = -F £es£/[2D(£ E S+^)] 
and therefore n 

F Q £ E sm 
JES 2(l + |m|^s)|m| l } 

The previous relation can be easily read in the two limiting cases of a strong ES effect (£es 3> £) and a weak one 
(£es "C £)■ In the former case all the atoms falling on the terrace contribute to the current and therefore jes is 
proportional to £ and independent of £es] m the latter one, only the atoms falling within a distance £es from the 
ascending step contribute to the current and Jes is now proportional to £es- 

Eq. ( O ) is clearly not applicable in the limit m — > 0, because the relation \m\ = l/£ is valid as far as no nucleation 
takes place on the terrace itself. The simplest way to obtain the correct expression for any value of m is to observe 
that the current must vanish on the high symmetry orientation m = 0; so, it is expected to be linear at small 
slopes: jes = orni and the constant a is determined by matching the two expressions for \m\ ~ 1/£d- This gives 
a = Fq£es£d/[2(1 + £es/£d)]- The simplest function interpolating between the two limiting expressions is: 

F £ E s£Dm 
3ES = 7777— i ,„_ i i^i,__ui i ■ i^_^ ( 15 ) 



2(1 + £es/£d + \m\£ ES )(l + \m\£ D ) 



In this chapter and in the following, z is the height divided by the monolayer thickness c, so that the slope m has the 
dimension of the reciprocal of a length. 



which reads, for weak and strong step-edge barriers: 



F £ E siDm o^-o nn\ 

JES = 2(1 + HfeXi + MW ES<< D (6) 

^- ipffe" fe>>fe (17) 



A more rigorous derivation oi jes at small slopes is given in Rcf. |58|: for |m| < 1/£d the surface is made up of a 
'sequence' of top, bottom and vicinal terraces of typical width £jj , whose average slope is just m: since vicinal terraces 
are the only ones to contribute both to the (average) slope and to the ES current, and both quantities are linear in 
the number of vicinal terraces in the 'sequence' under study, we conclude that the ES current and the slope must be 
proportional. This method gives the same expression as before for the constant a. 

Before going on, let us comment on different methods for determining jes- In the spirit of the procedure just 
explained, we could modify the starting differential equation for the density p in order to take into account nucleation 
at that level. This procedure has been used by Myers-Beaghton and Vvedensky [p4| , who have introduced a term 
proportional to p 2 to simulate the formation of a dimer. In this case the diffusion equation reads: Fo + Dp"{x) — 
sp 2 (x) — 0. This equation is not solvable in closed form, but in the limiting cases £ — > and £ — > oo it displays |6g] 
the same behaviour as before. In the limit £ — ► this is obvious, since in such a limit the quadratic term is negligible. 

We want to emphasize that two different lengths are relevant for the problem: the ES length Ies an d the diffusion 
length £e- The latter one is generally defined as the typical distance between nucleation centers on a high symmetry 
surface (see App. |b|), and the value of Ies does not enter in Ip. Nevertheless, when nucleation can take place on 
vicinal and/or top terraces, the ES barrier affects the adatom density and therefore the typical size of a vicinal or top 
terrace. It is possible f5q , p6|l to introduce a 'nucleation' length £ n for a vicinal terrace (£%) and for a top one (i^) (for 
a bottom one we simply have £^ — Id)- a new terrace nucleates on an old one when I becomes of the order of £ n . 
When £es — 0, £\ = £^ = £d and for a finite ES effect the nucleation lengths decrease (because the adatom density 
increases): £\ has a weak dependence on £es and £\(£es = oo) = £d/V% Q, while (^ goes to zero when £es — * oo 
(see Fig. |4]). Strictly speaking, the quantity Id in Eq. (|l^) should be replaced by £%, but we have just seen that 
£^ — £d for any value of £es- Because of this, the nucleation term (proportional to p 2 (x)) in the diffusion equation 
can be neglected if £ <C £d whatever is the value of £es'- in this limit, Eq. (H) should be recovered. 

Finally, we will mention the previously discussed 'microscopic' approach which makes use of the master equation: 



in the presence of step-edge barriers it gives 62 an ES current proportional to the slope, but no expression beyond 
the linear approximation is known so far. Nevertheless the expression for the proportionality coefficient a found by 
Vvedensky et al. [|62| has the merit to clearly show that the ES current vanishes when detailed balance is satisfied. 
Our expression (|15|) trivially satisfies the detailed balance: since thermal detachment from step has been neglected 
there, it is sufficient that jes vanishes for Fo = 0. 

The expression we have found for jes has two main limits: it is valid in 1+1 dimensions and it does not take into 
account possible discrete effects in the z direction. For example, if we are still in one dimension and m = means the 
high symmetry (10) orientation of a square lattice, then m = 1/a necessarily means the (11) orientation, which is also 
of high symmetry. Does the ES current vanish as well on this orientation? If so, Eq. (^5|) is wrong because it onl; 
vanishes for m — 0, ±oo. The fact that jes (± 1/a) should vanish is not obvious since flux could break the symmetry. 

Concerning real, 2+1 dimensional surfaces, we can observe that here as well as for other terms in the current we must 
accept some reasonable generalizations of the expression found in 1+1 dimensions and nothing more. Microscopic 
approaches can not help here, because they give expressions at the lowest order (for example, the linear one for the 
ES current), in which case the generalization is generally trivial. 

Nevertheless, since the evolution of the surface depends only on some very general features of the current, we should 
not be worried too much about this. We will see in the following that the only relevant features of Jes are: i) A 
linear behaviour at small slopes: Jes = &rn, with a positive a, so that the surface will be linearly unstable, ii) The 
existence or the absence of finite zeros in the current, iii) Possibly, the behaviour of the current at large slopes, if 
such zeros do not exist, iv) The in-plane symmetry of the current. 

The simplest expression for the ES current in the absence of the above mentioned zeros is the one proposed by 
Johnson et al. |53l: 



3 Siegert and Plischke (see note [11] of Ref. J67|]) have pointed out that it should not be relevant. Indeed, the problem is 
equivalent to have an incoming flux forming an angle of 45° with the surface (10): the ES current vanishes anyhow, since 
shadowing effects are completely negligible. 



-. am 

JES = TTm% (18) 

We observe that this expression may be valid only for £es ^S> Id- To take into account discrete effects in the z- 
direction and therefore possible zeros in the current, Siegert and Plischkc |37J have replaced m with m/(l — m 2 a 2 ): 
this corresponds to the substitution of tan# = ma by (1/2) tan 28 = t&n9/(T~~ tan 2 ff), 
i.e. a function of period 7r by a function of period w/2. The resulting current is |p7| : 

- am(l-a 2 m 2 ) _ 2 

3*s = (1 _ Q2m2)2 + m2£ 2 D = rnf(m ) (19) 

The previous expression vanishes on the 'circle' \m\ = l/a: there is a continuous in-plane symmetry. Discrete effects 
in the plane should break this symmetry. For example, for a cubic lattice we could write p7| , p8[ : 

(JEs)x,y = mx, y f{m 2 xy ) (20) 

Now, the number of zeros is finite and equal to four: m = (±1, ±l)/a. On the basis of the 'relevant features' of Jes 
discussed above, Eqs. (|l^) and ( [20| ) can be replaced by the following simpler expressions: 

Jes = oefh{l — am) Same qualitative properties as (19) (21) 

(JEs)x,y = cnm x ,y(^ — a2m x .y) Same qualitative properties as ( f20| ) (22) 



A model with jes given by Eq. (|2lj) has been studied by Stroscio et al. [|69| . 

To conclude this part, we can wonder whether some other mechanisms can contribute to a slope dependent current. 
It is indeed so: generally these mechanisms are not thermally activated and are due —on one side— to the release of 
the condensation heat when the adatom lands on the surface, and —on the other hand— to the fact that the adatom 
does not necessarily land on a hollow site. These processes are called (see Ref. [|70|) downward funneling, knock-out, 
transient mobility, and they are illustrated in Fig. |llj. 

Their effect is a down-hill current which depends on the density of steps and therefore on the slope of the surface. 
In the simplest picture they give rise to a current Jnt — —am which must be added to jes- Now, two possibilities 
exist: if a > J'es(®) the total current (jnt + Jes) is downhill and no instability takes place; conversely, if a < J'es(®) 
the total current is uphill at small slopes but downhill at large slopes: therefore it vanishes at a finite slope mp even 
if Jes does not! For example, if jes is given by the one dimensional version of Eq. ( |l8| ) then mo = \fjaja) — 1/£d- 

A final remark is in order: The current Jmt and therefore a depend on the crystal structure, but there is no reason 
to think that the slope mo at which it counterbalances Jes nas some special value. 

2. Mullins-type current 

Let us consider a grooved surface in the absence of any external flux, so that it is close to equilibrium. Furthermore, 
we suppose that the temperature is sufficiently low so that evaporation is negligible and the surface relaxes to 
equilibrium through surface diffusion. This problem is a 'classical' one (see the recent review [£lj) and for a non 
singular surface it has been solved forty years ago by Mullins [J72[: atoms thermally detach from steps and feel a 
surface chemical potential which is larger in the region of negative curvature. This determines a current from the 
top to the bottom of grooves, which relaxes the surface. More precisely, we have a current: jm — -TV/i where T is 
a kinetic coefficient. The expression given by Mullins is T — f3poD where po is the equilibrium adatom density and 
f3 = l/(fcsT), where ks is the Boltzmann constant, set equal to one in this chapter and in the following one. The 
chemical potential /i is the variational derivative of the surface free energyn: ii = S£ /6z, where £ has the general 
expression: 

dxdyip(z x ,z y ) with ip = a(\/z)<J 1 + c 2 (z 2 + z 2 ) (23) 



4 The correct definition fl4j is that fi is the Gibbs free energy per particle. The pressure will be assumed to be low enough 
to allow the replacement of the Gibbs free energy by the Helmholtz free energy. In the present chapter, the solid is assumed 
incompressible and the free energy reduces to the surface free energy £ , apart from a constant part, so that /i = S£/8z. In 
other chapters the elastic bulk energy will be taken into account. 



where z x>y = d x ,yZ, c is the out-of-plane lattice constant and ct(Vz) is the (slope-dependent) surface tension. 

In the limit of small slopes (|Vz| <C 1/a) and for in-plane isotropy, ip ~ (c 2 a/2)(z 2 + z 2 ) and /i is simply proportional 
to the curvature of the surface: /i = — a 2 c 2 aV 2 z, a — (er(0) + c~ 2 cr"(0)) being the surface stiffness. Finally, the current 
reads 

j M = KV(V 2 z) = KV 2 m K = ra 2 c 2 a = [3D{p a 2 ){ac 2 ) (24) 

where we have used the fact that V A m = 0. 

If we use jm as the surface current inside Eq. (Q), we easily find that a sinusoidal profile of wavelength A relaxes to 
the flat surface in a time of the order of A 4 . This current, as previously mentioned, has a stabilizing character. 

After this digression on the current for a surface which is i) non singular and ii) close to equilibrium, we can ask 
what all this has got to do with a growing surface, which also may be a high symmetry one. A first answer is that 
flux is just what makes Mullins' theory applicable even to a singular surface: in fact, the surface is roughened by 
the flux and therefore behaves as a non singular one, even if T is lower than the roughening transition temperature. 
Nonetheless, the basic mechanism of Mullins' current is thermal detachment from steps: a process which may be 
hindered by the flux, if the typical time for detachment is longer than the typical time (1/Foa 2 ) to fill in one layer. 
For example, Stroscio and Pierce ]73| assert that thermal detachment is negligible at room temperature during the 
homoepitaxial growth of Fe. Because of this, in Ref. f39J they replace Jm with a current of the form j — KV 2 V 2 rh. 
As we shall see, in the presence of a flux the thermal detachment mechanism can be dominated by other ones. 

In Ref. [ [74[ two of us have tried to study directly the growing, far from equilibrium surface. We started from the 
assumption that the relation j ne = — T ne V/i ne is still valid, where the subscript ne stands for 'non equilibrium'. \x ne 
must be read as a fictitious 'step chemical potential': steps at distance (. exchange atoms because of the existing 
gradient of /i: 5/J, ne /£. The result is T ne = pQ/3a 2 ~ d , where po is the emission rate for step site. The pseudo surface 
free energy £ giving rise to p ne is due both to step energy and to entropy. The result is that Eq. (p3) is still valid at 
small slopes, with K = K t her given by 174 ]: 

K t her^Poa 2 lD 1 + 1 dim (25) 

Kther = PoPa 2 j£ D 2+1 dim 

where 7 is the step stiffness (equal to the step free energy per unit length in the simplest case, when it is isotropic). 

At larger slopes the picture is more complicated because £ is no more quadratic in Vz. For example, in 1+1 
dimensions fi ne is not proportional to the curvature z", but rather to (z" /\z'\): the same result has been obtained by 
Krug et al. [64| for a solid on solid model, close to equilibrium. 

The main result of this part is that a current of the form ( |24| ) is expected even if the surface is strongly out of 
equilibrium, provided that thermal detachment is allowed (i.e. po 7^ 0). We want now to suggest that a Mullins-like 
current is expected even if thermal detachment is forbidden: in this case, Jm is due to nucleation and diffusion noise; 
Let us start with the latter one. 

Randomness in diffusion means that the number of adatoms reaching the step of a terrace of size I per unit time 
fluctuates around the average value (equal to F$£/2 in the absence of step-edge barriers). Nonetheless adatoms feel the 
curvature only close to a step and once a freshly landed adatom has approached a step it can no more be distinguished 
from a freshly detached atom from the step: therefore the contribution of diffusion noise to the kinetic coefficient K 
is entirely similar to K t her provided that po is replaced by (Fq£/2): 

,2«2 



K dif = F a 2 r D /2 1+1 dim (26) 

K dif = F a 3 £ 2 D l3j 2+1 dim 

The discussion of nucleation noise starts from the analysis of Eq. (R) in the linear approximation: 

3(Z=-V'(am + iCV 2 m) (27) 

where the first term in brackets is due to the ES effect. The solution is z(x,t) — zoexp(— iq ■ x + wt), with uj = 
u)(q) = aq 2 — Kq 4 . The flat surface is therefore unstable (w > 0) if q < q* = \J ajK and the most unstable 
mode (which maximizes uj(q)) corresponds to q u — y/a/2K. In other words, the Mullins-like current stabilizes the 
surface on scales smaller than A* = 2n/q* — 2iry / K/a and the instability develops after a finite time of the order 
of t u = l/u>(q u ) ~ K/a 2 and with a typical wavelength X u = 2ir Jq„ — ^/2\* . This stabilization is evident in the 
homoepitaxial growth of GaAs(OOl) as reported by Orme et al. |32|: the substrate is rough at small length scales 
(< 0.1/xm) and the preexisting defects are healed before the ES instability develops on larger scales. In the case of 



Ge(OOl) (Ref. pOfl) another feature appears: at higher temperatures, larger thicknesses must be attained in order to 
observe the instability. This is easily explained by the expression of t u : at higher temperatures the ES barriers are 
less effective and therefore a decreases, whilst K increases (see Eqs. ( p5| , p^ ) and Eq. ( p8[ ) here below). 

In Ref. |5^] we have studied a one dimensional model of growing surface where the only source of noise is nucleation 
of new islands and thermal detachment from step is forbidden. In such a model, according to the above discussion 
the K-terva should be absent; nevertheless numerical simulations clearly show that a finite A„ does exist; even more 
importantly, X u does not exist if we adopt a (somewhat artificial) deterministic model for nucleation.fi These results 
prove that nucleation noise is also a source of a Mullins-like current: Which is the relevant value of K = K nuc in this 
case? If £es — 0, the only parameters entering our model are the intensity of the flux Fq and the diffusion length (.£>. 
Since [K] =length 4 ~ d /time, dimensional analysis suggests that 

Knuc « F e% (28) 

both in 1+1 and in 2+1 dimensions. 

In conclusion, we have found that in a far from equilibrium surface, thermal detachment, diffusion and nucleation 
noise all contribute to a Mullins-like current: 

K = Kther + Kdif + K nuc (29) 

In 2+1 dimensions, since po — (D/a 2 ) exp(— (3W a ) where W a is the energy to transform a kink atom into an adatom, 
we will have that: 

K nuc ->K ther if 4(/3 7 a)«a 3 exp(/3IK) (30) 

K nuc ^>K dlf if ^»(/3 7 a)a 2 (31) 

Since (3ja « 1, expression (pW confirms that in the continuum limit Id 3* o^^ the 'nucleation noise' contribution 
to K prevails over the 'diffusion noise' contribution. Conversely, the contribution due to thermal detachment will be 
negligible at sufficiently low temperatures: exp((3W a ) ~^> (£o/a) 3 . 

We conclude by pointing out the main limit of this part: ES barriers have been neglected in the evaluation of 
the various contributions to K . The reason is that K t her, Kdif and K nuc are present even in the limit £es = and 
therefore for a weak ES effect we can neglect it. Conversely it is not certainly guaranteed that the given expressions 
for K are still valid for a strong ES effect. For instance K t her must vanish in such a limit, because in general adatoms 
will stick only to the same step they have detached from and therefore there is no exchange of atoms between steps: 
Kther{^ES = °°) = 0- For the same reason we also expect that Kdif((-ES — °°) = 0. 

3. Symmetry breaking current 

According to the terms introduced up to now for the surface current j, Eq. ([7|) has two symmetry properties: 
in-plane reflection symmetry (x — » —x) and up-down symmetry (z — > —z). The general validity of the former depends 
on the in-plane symmetry of the surface. In the case of a three-fold symmetry, which is pertinent for epitaxial 
growth of fcc(lll) metals, such symmetry is absent and the relevant current pa] will be given in Eq. (p2|). Conversely, 
irrespective of crystal structure and surface orientation, there is no basic reason to expect that the up-down symmetry 
survives when the surface is subject to an external flux. 

The simplest expression for a current which changes sign with x but does not change sign with z is: j$B = VG(to 2 ), 
where G is any even function of the slope to and the subscript SB stands for symmetry breaking. Before proceeding 
we can make some general considerations on jsb- If we consider a sinusoidal profile: z(x) = ZQsm(q ■ x) it is 
straightforward to write down: 

3M = -Kq 2 rh (32) 

jsb = 2G'{m 2 )[d 2 xx z + d 2 yy z]m (33) 

The ES current (see Eqs. JTg|-p2"|)) has the same sign of the slope and for this reason it has a 'destabilizing' character. 
In contrast, the Mullins-like current jm is stabilizing; finally, the symmetry breaking current has a different behaviour 



In this model a new terrace is nucleated on an old one as soon as its length equals the nucleation length £ n (see App. M) 
The artificial character of this model determines a nonanalytical form of the profile in the nucleation sites. 



according to the sign of the curvature: [d 2 z + d 2 z). We will see that the physically relevant case corresponds to 

G'(m 2 ) > 0: this means that jsb is destabilizing (resp. stabilizing) in a region of positive (resp. negative) curvature 
and therefore the bottoms of a grooved profile are narrower than the tops and in some models they even look like 
crevasses. 

The first analytical derivation of jsb has been done by Politi and Villain |p8[ in the case of a 1+1 dimensional 
surface. Afterwards, this result has been confirmed by Krug with a different method j j75[ . The existence of a curvature- 
dependent term in the current which has not the form of jm is easily deduced by the following consideration: a surface 
profile whose slope is not constant is surely time-dependent, even if £es = and all the sources of the Mullins-like 
term (thermal detachment and noise) are switched off: in this case the velocity of each step —in the BCF theory- 
is proportional to half of the sum of the lengths of the upper and lower terraces. This means that dtz ^ even if 
Jes = hi = 0. 

A very simple, but not rigorous argument to justify jsb is now given: the 'microscopic' current on a terrace is 
jmic = —DV p. In the paragraph on the ES current we have evaluated jes as an average of j m ic on a terrace of size £: 
step-edge barriers make p asymmetric with respect to the center of the terrace and therefore jes 7^ 0. Nonetheless, 
even if £es = the density p varies from one terrace to another, if there is a curvature in the surface profile: in fact, 
the solution of the diffusion equation DS7 2 p + F = 0ona terrace of size I with p vanishing at steps gives an average 
value p ~ (Fq£ 2 /D). The terrace width £ is of of the order of £d at small slopes and of the order of l/|m| at high 



slopes. The simplest function interpolating between these two limiting behaviours |58|j75[| is: £ 2 « ^/(l + m 2 £ 2 D ). 
Therefore we obtain: 

^- BV (§I^%)=- v (lT^) < M » 

So, the function G(m 2 ) appears to be: 



f £ d 

1 + m 2 f 2 



G K) « - , dh-2 ( 35 ) 



D 

whose derivative G'(m 2 ) is always positive. At large slopes: jsb ~ — -FoV(1/to 2 ) which agrees —in its 1+1 dimensional 
version— with the results of Refs. J5q , |75fl , The numerical prefactor is (1/8) for Politi and Villain and (1/12) for Krug. 
It is important that both methods give a constant prefactor, not depending on the ES length. If the rough argument 
given above to justify jsb is slightly modified to take into account finite step-edge barriers, the function G appears 
to be f^g-dependent, but the dependence is extremely weak: the prefactor simply varies between (1/12) and (1/3) 
for £es varying from zero to infinity. 

Anyway, we should realize that such argument is far from being rigorous, because the diffusion between terraces 
must be understood as an effective one, since steps capture atoms. In this respect, it is important to point out that 
Eq. (0) —and so the current j— is the result of a spatiotemporal average and therefore jsb may appear even if no 
interlayer diffusion is allowed by the microscopic dynamics. Indeed, more microscopic approaches based on the Master 
equation |62| confirm that such term is present even if thermal detachment is forbidden. 

At small slopes we have 

Jsb « F £ 4 D V(m 2 ) = AV(m 2 ) (36) 

It is noteworthy that A sw K nuc : this is not surprising because [A] = [AT] and since F and £e> are the only parameters 
entering the model, there is no other way to obtain a quantity which is dimensionally correct. The current ( |3q ) 
appears in the study of the conserved Kardar-Parisi-Zhang equation |7q| ; in the more specific context of MBE, it was 
introduced by Villain p9[ . 

We conclude by observing that a rigorous derivation of the surface current should give all the terms we have 
introduced: the ES current, which depends on the slope to; the symmetry-breaking term, which depends also on the 
curvature V • to; the Mullins-like current, which depends on higher order derivatives (V 2 to). Nonetheless, at the 
moment a systematic derivation is lacking. 

E. Experimental results 

In this paragraph we want to present and discuss some experimental results concerning unstable MBE growth on 
a high symmetry surface, which are thought to be due to step-edge barriers. Let us start by describing the main 
features of the ES instability on a singular surface. 



After deposition of a certain thickness of material, which depends on the material itself and on the growth conditions, 
a three dimensional mound structure sets in. These mounds can be characterized by a typical size L and a typical slope 
too: it is often the case that L increases with time via a coarsening process, where bigger mounds eat neighbouring 
smaller ones. Experimental data for L(t) are plotted in a log-log scale in order to obtain the so-called coarsening 
exponent n: L(t) ~ t". 

In Tab. we summarize some experimental data, concerning metals (Fe, Cu and Rh) and semiconductors (GaAs 
and Ge). For any experimental system we give: the growth temperature T in Kelvin; the coarsening exponent n 
with its error (when indicated); the range of sizes L m i n -=- L max of the mounds over which n is calculated or the 
typical values of L which are displayed; the slope of the mounds; the range of thicknesses N corresponding to the 
sizes given in the L column or the maximal thickness reached in the articles referenced here; the intensity of the flux; 
the experimental techniques. 

From the third column we see that n varies between 0.16 ± 0.04 and 0.56 and that three values out of seven are in 
the range 0.23/0.26. It is not always possible to know on which range of values of L the exponent refers to; if it is 
possible, such range (Lmin -r- L max ) is indicated in column "L". It is therefore seen that the ratio L max / ' L m i n varies 
of a factor from two J7JJ] up to ten |[l[ . 

It is important to point out that for all metals a mound structure already appears after the deposition of a very 
few layers, or even in the submonolayer regime. The case of semiconductors is different: for example let us consider 
the epitaxial growth of germanium ]50| |. Here —as well as for silicon— the main problem is that the system undergoes 
a 'crystalline/amorphous growth' phase transition at a critical thickness h cr which strongly depends on temperature: 
this implies for example that it is difficult to track the ES instability at different thicknesses and it may be necessary 
to increase T in order to obtain mounds of larger size. In the case of Ge, the size of the first mounds increases with 
T . Their typical size, much larger than in most of metals, indicates a weaker ES barrier. 

A second point we want to stress regards the slope of mounds: Is it constant or does it increase with time? 
Experimentally, the question puts additional problems for two reasons: slope may increase because the 'constant 
slope regime' has not yet been attained, or slope may look to be constant in the range L m i„ -=- L max because the 
increase rate is too slow. These possibilities should be taken into account for Refs. |79| and |69], because in both cases 
larger and constant slopes are found in the same |77| , [78| or similar pi systems, respectively^ 

We finally observe that large slopes, corresponding to well defined orientations, are found only for metals, while 
unstable growth in semiconductors seems to be characterized by fairly small slopes (< 3°). 

F. Continuum theory of the instability 



In chapter IV D we have discussed the surface current j which should enter in the evolution equation for the surface 
profile. Here we will concentrate on the few models for which some well established results exist and we will discuss 
how 'robust' such results are with respect to modifications in the model. 

1. One dimensional models 

We have said that j is generally made up of three terms: 

3 = 3es + 3m + Jsb (37) 

Concerning the ES current there are two main classes, according to the existence or not of finite zeros in Jes- Anyhow 
JEsim) can always be derived from an effective potential energy U(m): 

r /* 

3Es(rn) = --— / dxU(m(x)) with [/'(to) = -j E s{™>) (38) 

dm J 

The Mullins-like current has the expression Jm = Km" (x) at sufficiently small slopes, which can also be written: 

]M = -JL(dx^(m') 2 (39) 



Clearly we must take care in comparing experimental results obtained at different temperatures (Ref. |69| at room temperature 
and Ref. H at T ~ 150° C). 



Nevertheless there is no basic reason to use the small-m expansion for jm if the surface develops an instability with 
large slope regions. Starting from Eq. ( P3[ ) it is easily found that a more general expression is: jm — ~fd x {cr(m)m'], 



where a{m) = [a(m)/\/l + c 2 m 2 + a'(m)\/l + c 2 m 2 /c 2 m] is an even function of m. In this case jm is no more 
derivable from a free energy: some considerations will be done at this regard in the following. 

Finally, the term jsb = d x G(m 2 ) can not be derived from a free energy either. This is not true for any symmetry 
breaking terms, but it is necessary to pass to a higher order term |82]. For example: 

Jsb = AcU(m') 2 ] = -/- / dxTss (40) 

dm J 

withes = (A/3)(W) 3 - 

If we neglect for the moment all the nonpotential terms, we are left with the expression: 

j=jEs(m)+Km"(x) (41) 

where Jes may have (model I) or not (model II) zeros at finite m. 

Before starting to study dynamics, it is important to consider stationary configurations of Eq. (pf), corresponding 
to a vanishing current: j = 0. This equation is immediately recognized as the equation of motion of a particle of 
mass K moving in the potential V(m) = —U(m). To be definite let us consider the following expressions for the ES 
current: 

Jes = am[l — m /m ] model I (42) 

JES = [1 + mHlV m0dd n (43) 

Model II reduces for 7 = 1 to the current introduced by Johnson et al. |33[ in 2+1 dimensions (Eq. (jig)) and 
by Hunt et al. (p3| in 1+1 dimensions. All microscopic models yield 7=1. Nonetheless it is useful to study the 
generalization to 7 > 1 p4j. The corresponding expressions for V(m) are: 



Vim) = —m tttti model I (44) 
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V(m) = ( — T ) ln(l + m z t D ) model II, 7 = 1 (45) 
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model II, 7 > 1 (46) 



In Fig. [[2] we plot these functions. 

In our mechanical analogy m plays the role of space, x of time and stationary configurations correspond to oscillatory 
motions around the equilibrium position m = 0. Such motions have different behaviours in the three cases: the 
important feature is how the period of the oscillations (i.e. the wavelength A of the stationary configuration) depends 
on the amplitude (i.e. the maximal value m* of the slope). In the harmonic regime [V(m) = am 2 /2] the period does 
not depend on the amplitude: this corresponds to the linear regime of Eq. (Q) , with a well defined wavelength and an 
'undefined' amplitude (depending on the real time, in the dynamical problem). When the amplitude is large enough 
we enter in the nonlinear regime and the question is: Does A increase or decrease with m*l For both models I and 
II, as well as for the current given in (p5|), A increases with m* because the 'force' jes decreases with respect to the 
harmonic regime, i.e. Jes/™ decreases with m. This condition can be found by writing: 



777 * / J^TIl * 

|j*| = |i*s| =* K—*j ES (m*) =* A«J-— - (47) 

A 2 y 3Es{m*) 

This means that stationary solutions of arbitrarily large wavelength exist and they correspond to increasing values 
of m*: this common feature is one of the necessary ingredients to have a coarsening phenomenon. However, important 
differences exist between the models: in model I, A — ► 00 when m* — > m^ , while in models II, A — > 00 when m* 
also diverges. In model I it is possible to define a stationary solution which corresponds to the boundary conditions: 
m(x) — ► ±to when x — * ±00. This solution is called 'domain wall' or 'kink' because m(x) is always constant (and 
equal to ±too) except in a narrow region around x = 0. The width of this region is 

Ldw = ] m W\^ x, ~ !4Si 




which (up to a numerical factor of order unity) is nothing but the lower critical wavelength A*: The surface is 
dynamically stable at smaller scales. 

In the limit to* — > m^ stationary configurations are made up of domains of increasing size L where the slope is 
alternately equal to ±too, separated by domain walls. This picture breaks down for model II because now m* — ► oo 
and it is no more possible to define domains of constant slope. Nonetheless a major difference exists between the cases 
7=1 and 7 > 1: for 7 = 1 there is no stationary solution corresponding to the boundary conditions m(x) — ► ±00 
when x — > ±00, while this solution does exist if 7 > 1. This also implies that for 7 = 1 (the physically relevant case) 
the maximal velocity in m = diverges when to* ~ * 00: in other words stationary solutions are characterized by a 
curvature which diverges in the minima and maxima of the profile (i.e. where m{x) = dz/dx = 0). 

Stationary solutions are important for the dynamics for the following reasons: coarsening means that these sta- 
tionary configurations are stable with respect to amplitude fluctuations, but unstable with respect to wavelength 
fluctuations. The evolution of the system is such that the surface profile passes through stationary configurations of 
increasing wavelength. 

Let us now come back to the expression (|4l|). Since our one dimensional current j is derivable from the effective 
free energy: 

T = I dx 



— (m') 2 + U(m) 



(49) 



the evolution equation for the order parameter m(x) is given by Eq. (10): 



d t m(x, t) = d 2 x ( |£ J + d x (noise) (50) 

For model I we exactly obtain the so-called Cahn-Hilliard equation [p5[ , which has been extensively studied. First 
we give the results and afterwards some qualitative explanations: 

(51) 
(52) 

The best way to explain these results is to recall the kink picture. A system with only one kink corresponds to a 
surface profile with only one mound: m(x) = z'(x) — > ±too when x — > =Foo. This stationary profile is stable, but if 
we put more kinks the profile is no more stable because kinks interact through the tails of their profiles. The shape 
of the tail can be easily determined linearizing the equation j ' = around mo: m(x) = rriQ — e(x). We obtain: 

JEs(m) + Km"(x) = -j' ES (m )e(x) - Ke"(x) = (53) 
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shot noise ^ Ref. 85 



The function e(x) has therefore an exponential behaviour: e(x) — eo exp(— kx) with k = \/\JEs( m o)\/K = y/2a/K. 
The inverse of k can be recognized (up to a constant) as the width of the domain wall: L& w = 1/k. 

It is therefore reasonable to expect that kink interaction and kink velocities (in the absence of noise) decrease 
exponentially with the inter kink distance L(t). Coarsening means that kinks feel an effective attractive interaction: if 
a kink has a velocity v ~ exp(— kL) where L is the distance with its nearest kink, then the time t(L) necessary for 
the annihilation of a pair of neighbouring kinks is t(L) ~ Lexp(ftL), which gives: L(t) ~ L^lni. 

The effect of noise is more difficult to take into account, because m(x, t) is a conserved order parameter: 
dt J dxm{x,t) = 0. On one side noise would make kinks move independently through a sort of random walk; on 
the other side the constraint of conservation couples the motion of the different kinks. In the absence of conservation 
we would simply have L(t) ~ t ' . In fact, in a random walk the time necessary to travel a distance L is t ~ L 2 . If 
m{x) is conserved the constraint slows down the kink motion and we pass from t x l 2 to t 1 ^ 3 . 

Now let us comment on the case a(m) ^ constant. If a depends on the slope m the current reads: 

j = Td x (a(m)m')+JE S (m) (54) 

What about stationary configurations? We can define the function M(m) = L dua(u), so that: 

j = TM"(x)+j ES (m(M)) (55) 

where M{m) is an odd function which always increases with to, since M'(to) = ct(to) > 0. We have seen that 



M 



JEsjm) m 
m ' M 



(56) 



must be a decreasing function of to, in order to have coarsening. The first ratio on the right hand side is a decreasing 
function of to, because we suppose that coarsening does exist if a(m) is constant (i.e. if M = to). On the other side, 
if ct(to) decreases with to, the ratio (m/M) increases because M/m is the average value of a(u) in the interval (0, to). 
According to this, the right hand side of Eq. (pq) is the product of a decreasing function and of an increasing function 
of to and therefore no general conclusion can be drawn from it, if a specific form for a(m) [and therefore for <t(to)] is 
not assumed. 



2. Two-dimensional models 



In chapter IV B we already remarked that a continuum description of crystal growth in 2+1 dimensions differs from 
a standard phase separation process even if the current can be derivable from an effective free energy. Let us discuss 
this point. 



The generalization of Eq. (41) is: 



J = JEs(m) + KV 2 m 



(57) 



This equation is almost always derivable from a free energy (it is indeed so for Eqs. (|18- 22 )) but not for any form of 
jESi m opposition to the one dimensional case. The reason is that 
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while Jes can be written as 
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22) and similar ones have the general form: 
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where the first one is valid for in-plane symmetry and the second for a square symmetry. In both cases the relation 
( p0| ) is fulfilled, as well as for their generalization (jn\j, see below. For a three- fold symmetry, Siegert |68| has suggested 
the following modification of Eq. (|22|): 



{Jes)x = a(l + 2m x )[m x (l - m x ) - m 2 y ] 
(JEs)y = am y {\ - 2m x - 2m 2 ) 



(62) 



Again, this equation fulfills the condition (pCT). The conclusion of this part is that —in practice— the passage from 
1+1 to 2+1 dimensions does not put additional problems for the derivation of j from a free energy. 

We now state the two major features which are introduced in 2+1 dimensions: i) The constraint V A to = 0, and 
ii) The impossibility for the time evolution of to to be governed by the Cahn-Hilliard equation. It is important to 
point out their common origin: the fact that the order parameter m derives from the local height z(x,t) [to = Vz =J> 
V A m = 0] and therefore the fact that its evolution equation derives from Eq. (0). So, we have 



d t m = -V(V ■ j) 



(63) 



and not 



d t m 



-v 2 J 



(64) 



Conversely, the phase separation process for a conserved order parameter is governed by Eq. (64), in order to implement 
the conservation law 



0, 



I dxrrii = — / dx\7 2 ji 



(65) 



which vanishes for the divergence theorem. 

Siegert and Plischke BftpSLpTJ have solved numerically Eq. (In) with j = jes + KV 2 m, where jes is given by 
Eq. ( pL9| ) [continuous symmetry in the plane] or Eq. ( |20| ) [four fold symmetry in the plane], and in the presence of 
noise. 

For a 'true' phase separation process, the two currents give different results for the coarsening law L(t) ~ t n (see 
the review p8|): n = 1/4 if the potential Uirn) has a continuous infinity of minima [in |m| = 1/a, for Eq. (|l9|)] and 
n = 1/3 if Ujfn) has a finite number of minima [in m = (±l,±l)/a, for Eq. (EQ)]. So, the result by Siegert and 
Plischke for the surface growth models (Eqs. ( |19| - p0|) ) is surprising: n ~ 1/4 in both cases |67J| . 

A deeper comprehension of the difference between the surface growth evolution and a 'real' phase separation process 
can be gained by comparing the different pictures of Fig. |l3|. 

They refer, respectively, to the following models: 
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where: 
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We stress that (b) and (c) are both growth models, but with different symmetries of the order parameter while (a) 
and (b) have the same symmetry for m, but the operators 'gradient' and 'divergence' are interchanged. 

A point of basic importance is that in surface growth a domain wall must be straight, because it corresponds to 
the intersection of two planes. 

In this respect Siegert, Plischke and Zia B9| have also proven that domain wall relaxation is much faster than 
coarsening, so that domain walls look straight on the time scale of coarsening. 

More precisely, if toi and fhi are any two minima of T a m or Ti SO : mi,2 = \m\ (cos 0i^, sin #1,2), then their intersection 
takes place along a line forming an angle 6 — (6\+ 62) /2 with the horizontal axis. 

This means that in principle model (b) allows for only four distinct kinds of domain wall, forming respectively 
angles 9 = 0,±7r/4,7r/2 with the x-axis, but domain walls parallel to the x,y axes (9 — 0, n/2) cost less energy 
(because one of the two components of the order parameter keeps constant) and therefore dominate the others. On 
the other hand, while the domain walls parallel to the Cartesian axes can intersect, giving rise to a pyramid, the 
other two walls cannot intersect. The constraint VAm = requires that each 'diagonal' domain wall (9 = ±7r/4) —if 
present— terminates at the same ending point as a horizontal and a vertical domain wall. In Fig. n3b all the possible 
intersections are visible. 

In the isotropic growth model (Fig. |l3|c) the energy cost of a domain wall does not depend on its orientation, but 
the constraint deriving from m = Vz implies a relation between the values of m in neighbouring domains and the 
direction of the separating domain wall. The consequence is a periodic surface profile which is made of a network of 
square-based or hexagonal-based pyramids. Only in the former case the up-down symmetry is fulfilled. The statistics 
of the orientations of the different facets /domains indeed show |x| that they are not randomly distributed, but 
four orientations to, = (cos #j, sin 64), with 9i + i = 9i + ir/2 are more likely to take place. We speculate that this 
phenomenon would be even more evident in the presence of noise (as in Fig. |l3jb). 

In a very recent paper | Bj| , Martin Siegert has proposed a more complex scenario for coarsening in 2+1 dimensions. 
He stresses the importance of having two classes of domain walls according to the values of 9 (see above) : parallel to 
the axes (9 — 0, 7r/2) and diagonal ones (9 = ±7r/4). This fact should prevent from the applicability of any theoretical 



approach based on the existence of a single length scale, because the densities of the two different classes of kinks 
(po = p{6 = 0, 7r/2) and p\ = p{9 = ±7r/4)) have different time dependences. Even if diagonal domain walls are 
more energy-expensive, they play a basic role in the dynamics because the surface would not coarsen in absence of 
them p9 , Bj |, 

The two relevant length scales are R = 1/po and D = 1/pv The latter one has a fast increase: D(t) ~ i 1 / 3 , while 
the former one increases more slowly, with an exponent close to 1/4. 

The previous picture can be completed with an expression for the ES current which is more general than Eqs. (pl|,p2|): 



(JEs)x=Tnx{l-m 2 x -bm 2 y ) (JEs)y = m y (l - m 2 y - bm 2 x ) (71) 

Eqs. (|2l]j2^ ) are recovered respectively for b — 1 and 6 = 0. The parameter b (whose relevant range of variability for 
a quadratic symmetry is —1 < b < 1) gives the dependence of each component (JEs)x,y of the current on the relative 
perpendicular component of the slope m v>x . Such dependence is mainly due to edge diffusion, whose importance 
has been experimentally shown in metal epitaxy by Jorritsma et al. ]7q |. The picture given above is applicable for 
b > —3/4, when diagonal domain walls are suppressed by energetics. Conversely for — 1 < b < —3/4 they are favored 
over horizontal/vertical domain walls. Since coarsening requires the nucleation of roof tops, for b = —3/4 this process 
does not cost anything and a larger coarsening exponent (n = 1/3) is found pl| . 

The same result should be valid for a triangular symmetry, relevant for (111) surfaces of fee crystals, where only 
one type of domain walls is present. This prediction agrees with the experimental results by Tsui et al. [ pl| on the 
growth of Rh(l 11). 

We conclude this part with some considerations on different continuous growth models. 

3. Symmetry-breaking terms 



In 1 + 1 dimensions it has been recently shown 92 that a current of the form jsb — d x G(m 2 ) does not change the 
coarsening laws, at least for the model I. Let us discuss briefly this point. 

Kawasaki and Ohta [p5| studied the dynamical evolution of model I in one dimension by reformulating the problem 
of phase separation for the order parameter m as a problem of nonlinear dynamics of kinks. A kink represents a 
(zero dimensional) domain wall between regions where m takes different constant values, corresponding to the zeros 
of the ES current (= ±mo for the current (E2)). The kink profiles M±(x) are the solutions of the equation j = 
with M + (±oo) = ±mo for the positive kink, and M_(±oo) = +mo for the negative one. Because of the up-down 
symmetry, M^{x) = — M + (x) and it is easily found that 



M±{x) = ±m tanh(/c x/2) with k = \JlajK (72) 

Because of kink interaction, kink dynamics has a deterministic component governed by the minimization of the 
pseudo surface free-energy, and a random component due to shot-noise. 

Since the kink picture does not depend on the derivability of the surface current from any free-energy, it has been 
applied 193] to the case of symmetry breaking induced by jsb — ^d x (m 2 ). We can summarize the main results as 
follows: (I) The functional form of the kink profiles does not change, (ii) It is no more true that M_(x) = — M + (x). 
In fact now: 

M±(x) = ±wo tanh(K±a;/2) (73) 

with k + > K- (positive kinks are narrower than negative kinks) and k + k_ = Kq (the product of their widths is 
constant), (iii) The total effective interaction between kinks is not qualitatively modified, (iv) The coarsening laws 
are the same, but the crossover time between the logarithmic one (at short times) and the power-low one (at longer 
times) may be significantly increased, (v) Positive kinks may be so narrow that their width is comparable to the 
lattice constant. In this case the positive kink should be replaced by an angular point, with a discontinuity of the 
surface slope. 

Concerning the symmetry-breaking in 2+1 dimensions, the only study of a surface growth model in the presence 
of such term is a numerical analysis due to Stroscio et al. Q, who do not use the Mullins term: jm = KV 2 m, but a 
higher order one: j — i^V 2 (V 2 m). Their numerical integration gives a coarsening exponent n — 0.18 ± 0.02 which is 
told to be insensitive to the presence or the absence of jsb- 



Jf.. Constantly increasing slope 

At the moment there is no rigorous study —not even in 1+1 dimensions— for a model where the ES current has not 
zeros at finite slope. Hunt et al. J83J have studied numerically model II with 7=1 (see Eq. (|43|)) in 1+1 dimensions, 
and found n ss 0.22 in the presence of a weak shot noise: therefore, a result well different from the values obtained 
for model I: n = 1/3 in presence of noise and n = (logarithmic coarsening) in absence of noise. The qualitative 
argument based on noise effects and given in App. |C], if applied to the case 7=1 gives p3| , |94| n = 1/5 in 1+1 
dimensions. 

The kink picture is manifestly not applicable for model II, since the current has no zeros at finite m and therefore 
regions of constant slope cannot exist. Even more important, the equation j = has no solution with M(±oo) = ±00 
as boundary condition: this means that periodic stationary solutions do not converge to a self-similar limit. 

Preliminary results |84[| on models II (with 7=1 and 7 > 1) in 1+1 dimensions show that steepening makes 
coarsening faster in absence of noise and slower in presence of it. The reason of the deterministic "acceleration" is the 
following: even if a kink picture is not suitable to describe the interface, at least for 7 > 1 the kink can be replaced 
by the wave function of the ground state of a potential well, whose profile depends on the exact form of the unstable 
current. Since the kink profile has an exponential tail while the wave function decays algebraically for models II, 
potential wells interact more strongly than kinks, and as a consequence of this coarsening is faster. The reason of the 
noisy "slowing down" has not a simple explanation, but it is probably due to the competition between steepening and 
coarsening (see also the next paragraph on Monte Carlo simulations). 

5. The Zeno model and the need of a nonlocal evolution equation 

The 'Zeno model' [p8| deserves some attention, it is a one-dimensional model for a high-symmetry surface which 
evolves through instantaneous attachment of adatoms to preexisting steps and creation of new steps via nucleation 
of new terraces (a process which is intrinsically random). In this model only fluctuations in the nucleation process 
contribute to the Mullins-like term (K = K nuc ); since nucleation is unlikely on small terraces, the K-teim is expected 
to decrease when the slope increases. This explains the strong up-down asymmetry displayed by the model (see 
Fig. [l4] ) : nucleation is not effective to counterbalance the ES current in a region of high slope and the SB term is 
destabilizing in a region of positive curvature. In the Zeno model the asymmetry has a striking consequence: the 
regions of positive curvature are replaced by angular points. Because of the discontinuity of the slope m in an angular 
point, the SB term in the current (jsb — d x G{m 2 )) gives a divergent contribution to the velocity of the interface, 
if a local evolution equation is assumed: dtz — — d x jsB-i\ The solution |38| is to replace dtz(x,t) = — d x j with the 
following nonlocal evolution equation: 

d t z(x, t) = -d x I dx' X (x - x')j{x', t) (74) 

where j(x' , t) is the local current evaluated in the point x' and the kernel x( x ~ x ') nas the form: x( x ) = ex P( — M/^o)- 
The Zeno model has a second major feature: the existence of an upper critical wavelength X^, up at which coarsening 
stops. The reason is that stationary configurations (d t z = 0) exist only for A < Aj!" p . At larger scales mounds are 
unstable even for height fluctuations, and therefore deeper and deeper crevasses form in between. 

G. Monte Carlo simulations 

This final paragraph is devoted to kinetic Monte Carlo (MC) simulations of a singular surface, in the presence of 
step-edge barriers. We will limit ourselves to some representative works in 2+1 dimensions. 

An important preliminary statement concerns the implementation of a constant slope in a simulation where crystal 
structure effects are difficult to take into account. The adopted solutions may differ: In a simple cubic SOS model, 
the simplest way is to allow the freshly landed adatom to hop to a neighbouring lower site (see Refs. |87]|)5(]). For 
more realistic cubic lattices (bcc/fcc) the typical solution is a downward funneling mechanism |9(],|97| according to 
which a higher coordination site has a larger 'capture area'. 



The Mullins term gives an even more divergent contribution (because of the higher derivative) , but nucleation (and therefore 
K nuc ) is completely negligible in a large slope region. 



Smilauer and Vvedensky |95[ obtain different time dependences for the slope by varying the intensity of the down- 
hill current. Two limiting cases are noteworthy: constant slope and absence of the down-hill current. In the former 
case, the relevant model should give a coarsening exponent n ~ 1/4. Simulations give a value varying between 0.21 
and 0.26. The latter case (absence of a down-hill current) is interesting because slope is certainly expected not to 
saturate (slope~ t ), and a naive application of the condition \jes\ ~ \Jm\, with \]es\ ~ 1/I TO I would give d' = n. 
Simulations p5|, in one case out of two, give a result which agrees: n = 0.19,$' = 0.21. An important feature 
which emerges from their simulations is that size growth is slowed down by an existing slope increase, that is to say 
steepening and coarsening of mounds are two competitive processes. This fact is supported by the qualitative analyses 
of Refs. |B3,M, discussed in App. O: the exponent n decreases, if the slope passes from being constant to increase 
with the mound size. 

Siegert and Plischke |$7J consider, in addition to isotropic ES barriers, anisotropic ones for which (JEs)y = and 
the resulting mound structure has a one dimensional profile (m y — 0). Nonetheless the anisotropic 2+1 dimensional 
model differs from the same model in one dimension (at least if noise is present) because the latter does not order at 
finite temperature. So, while the isotropic case gives n = 0.26 (very close to 1/4) the anisotropic case does not give 
n = 1/3 —as expected for the noisy one dimensional model— but a much smaller value: n ~ 0.18. 

Finally, we want to comment on the works by Thurmer et al. |97j and by Amar and Family |9(| on bcc/fcc(100) 
lattices. The models seem to be fairly similar, but the results differ substantially: Thurmer et al. find constant 
slopes —corresponding to the facets (012)— and a coarsening exponent n « 0.24/0.26. On the other side Amar and 
Family find a much broad range of variability for n (n w 0.16/0.25 for weak barriers and n ~ 1/3 for strong ones) and 
the slope has various behaviors. A possible explanation is that even minimal differences in the model may influence 
step-edge diffusion, and -after Refs. |7g,|9l]]- the coarsening behaviour.^] 

V. KINETIC INSTABILITIES OF VICINAL SURFACES 
A. Introduction 

A vicinal surface is a surface which is cut along an orientation close to a high symmetry one. It is made up of low 
index terraces (e.g. (001) or (111)) separated by atomic steps. Steps are unavoidably present in any real surface: for 
example, because the surface is always slightly miscut, but also because steps may be thermally activated or because 
a screw dislocation ends up in a step at the crystal surface; finally, steps may be created during the growth through 
nucleation. The first factor is the only one to produce steps all of the same sign (ascending or descending ones), and 
the second factor is completely negligible at small temperatures. 

In the presence of an external flux, a step acts as a sink by capturing adatoms diffusing in the vicinity and therefore 
advancing. In average a straight step (as those which result from cutting the crystal) remains parallel to its original 
direction, while the steps due to screw dislocations give rise to a spiral growth. In this chapter, dislocations as well 
as thermally activated steps will be ignored. 

A growing vicinal surface can only be stable if three conditions are fulfilled, i) Diffusing adatoms stick to preexisting 
steps rather than nucleate new islands on the terraces, ii) Terraces keep the same width, iii) Steps remain straight. 
When one of these conditions is not fulfilled, it gives rise to a well defined instability, called respectively i) three- 
dimensional growth, ii) step-bunching and iii) step-meandering. 

Since the typical distance between nucleation centers on a high symmetry surface is the so called diffusion length Ip 
(see App. 0), we would be led to conclude that the condition for avoiding three-dimensional growth is simply I < £d, 
where t is the terrace width. Nonetheless nucleation is a stochastic process and therefore nucleation events take place, 
even if rarely, on a terrace smaller than (.£>. So, an important issue is the asymptotical stability of step-flow growth 
against nucleation: this question will be discussed at the end of the chapter, and till then nucleation will be ignored. 

Let us start by showing what the ES effect has to do with step-bunching and meandering instabilities. To make 
a clear distinction between the two phenomena, we will consider the following limiting cases: i) a one dimensional 
vicinal surface (in order to avoid the possibility of meandering) and ii) a single step (in order to avoid step-bunching) . 
i) Let us suppose that adatoms prefer to stick to the ascending step, as they do in the presence of an ES effect: this 
means that a terrace which is larger than its neighbours will reduce its width because the ascending step proceeds 
faster than the descending one. The opposite is true for a smaller terrace. The conclusion is that the ES effect 



We should also observe that in Ref. |97| fits are generally carried out at larger thicknesses that in Ref. |96| 



stabilizes the width of the terraces during growth. 9 Conversely, if the crystal surface evaporates, an ES effect means 
that an adatom will detach preferentially on the lower terrace and therefore a larger terrace will become even larger: 
a step-bunching instability takes place during evaporation. This scenario was predicted thirty years ago by Schwoebel 
and Shipsey |38| . 

ii) Now let us consider a single step. This makes sense only if evaporation is present, otherwise an infinite current of 
adatoms would stick to the step and it would move with infinite velocity. In the presence of desorption an adatom has 
a lifetime t, which means that it travels through a linear distance x s = y/Dr before evaporating: so, only adatoms 
landing within a distance x s from the step will contribute to the step advancement. If a train of steps is correctly 
considered (as in case i), the limit of vanishing desorption is still physical, because sinks at distance I do exist. 

The meandering instability is conceptually similar to the diffusion instabilities treated in chapter O: the growth 
front develops some protrusions which capture more atoms and therefore grow even more. The main difference is that 
also atoms coming from the 'inner side' of the front contribute to step motion, and the meandering instability can 
arise only if the inner contribution is too weak to compensate the outer contribution, in other words if an ES effect is 
present. However, Saito and Uwaha p£| | have shown that a meandering instability can take place even in the absence 
of an ES effect, if the step velocity exceeds a critical value. In fact, during the growth, in the reference system of 
the step the adatoms of the lower terrace have a higher component of the velocity in the direction perpendicular to 
the step and therefore the 'flux' of the incoming adatoms from below is greater than from above Fj However, the 
required flux is so high that the diffusion length Ip is very small and nucleation can no more be neglected. Finally, 
the drift of adatoms may be even i ncre ased and controlled via application of an external electric field, which produces 



the electromigration phenomenon | IOC ] 



We want to stress that the conditions for the meandering instability and those for step bunching are opposite: the 
former takes place if the surface grows, while the latter if the surface sublimates. Therefore, the consequence of points 
i) and ii) is that a two dimensional train of steps is generally expected to be unstable. 

The study of the dynamics of a vicinal surface will proceed as follows: linear stability analysis of a single step |54] 



and of a train of steps [101]; study of the w eakly nonlinear regime, close to th e threshold of instability, for a single 
step |102|] an d for a train of steps [103,104]; Monte Carlo simulations j99,105|; continuum description of the vicinal 



surface~]l04-107|. We will conclude with some remarks on three-dimensional growth and step-flow |105 10S] 



B. Single step: linear stability analysis 

Before starting we would like to point out the qualitatively different approach which is required by a vicinal surface, 
with respect to a high symmetry one. On a singular surface, steps are continuously created (through nucleation) and 
destroyed (through coalescence): at least in the case of stable growth, the lifetime of a step is given by the typical 
time necessary to fill one layer: tuh — l/(-Fo fl2 )- Since the continuum description is valid on time scales larger than 
ImLi steps do not enter 'directly' in such description. Conversely, steps have an infinite lifetime on a vicinal surface 
(if nucleation is neglected) because they all have the same sign (descending or ascending one). This means that a step 
motion picture is better than a 'surface current' picture to describe the dynamics of the surface. 

A second remark concerns the microscopic dynamics: the density p of adatoms on a terrace is governed by the 
diffusion equation in the quasi static approximation: 

DV 2 p - - + F = (75) 

T 

which is a linear equation. The nonlinear character of the step evolution derives from boundary conditions at steps. 
In the following, x will be the average direction of the step and y the in-plane perpendicular direction, so that 
the step may be described by a single-valued function £(x, t) with the lower terrace corresponding to y > £. The 
normal component to the step of the gradient of concentration (dp/dn) is proportional to the difference between the 
nonequilibrium (p) and equilibrium (p eq ) concentrations at the step itself: 



9 It is important to recall that impurities can pin the steps and therefore play a major role in step-bunching during growth [B8J . 
10 In this case the instability may take place even during evaporation; however a finite ES effect stabilizes |)9| the profile of 
the receding step. 



where the signs +/— correspond to the lower/upper terrace (i.e. to y = C±) an d the equilibrium concentration is 
given by: 

Pe q = p° eq (l + rn) (78) 

The second term depends on the curvature [J k of the step: 

d 2 C 



[1 + (9,C) 2 ] 3/2 



(79) 



and reflects the Gibbs-Thomson effect: close to a region of positive curvature the concentration is higher because of 
the higher rate of thermal detachment from the step. The coefficient T = il(3j depends on the surface area per atom 
fi and the ratio between the step stiffness 7 and the temperature T = l/(fcg/3). 

The existence of an ES barrier means that fc+ > fc_: if the barrier is perfectly reflecting (one-sided model), then 
k- = = (9„p)_; on the other side, if the step is in equilibrium with the lower terrace, then (p — p eq )+ = which 
corresponds to k + — 00. In the case of a single step, the boundary condition to infinity (y — ► ±00) is that the 
concentration goes to a constant value, which corresponds to the balancing between deposition and evaporation: 

p(x, y, t) — > poo = tF single step (80) 

Once the concentration profile is known as a function of the step profile, the step velocity v n may be determined 
via the relation: 

(A Ps )v n = D[{d nP )_ - {d nP )+\ (81) 

where Ap s is the difference of surface concentration between the solid phase and the 'adatom gas' phase. Since the 
latter is negligible with respect to the former: Ap s = I/O.. 
Finally, v n and ^(ar, t) are linked by a kinematical relation: 

V0 + dtC (82) 



>fi + aO s 



where vq is the 'drift' velocity, pertaining to a straight step. It is noteworthy that for the linear stability analysis, the 
formulas: k = —d 2 ( and v n — vq + dtC can be used. 

Let us now consider the simplest case: a single step in the one-sided model (i.e. an infinite ES barrier). We will 



follow the treatment given by Bena, Misbah and Valance [102 



The concentration profile and the 'drift' velocity w are given by: 

Po(y) = Poo - (poo - p" g ) exp(-y/x s ) vq = Qx s (F - F eq ) = ttx s AF (83) 

where F eq = p e „JT. Their interpretation is simple: only adatoms arriving within a distance x s from the step are able 
to stick to it before evaporating and therefore contribute to the step velocity. If thermal detachment is forbidden, vq is 
simply given by the number of atoms sticking to the step per unit time and unit length, i.e. by the term proportional 
to Fo in (p3). The term proportional to F eq takes into account the rate of detachment from the step. Finally, po(y) 
is the one-dimensional solution of Eq. (|75| ) 'interpolating' between p° in y = and poo at infinity. 

The linear stability analysis proceeds by considering a small deviation from the stationary concentration: 

P(x, y, t) = p (y) + pi(y) exp(iqx + ut) (84) 

and from the straight step profile: 

((x, t) — Ci exp(iqx + cut) (85) 



L In the following the curvature is understood to be positive if d^C < 0. 



and by solving the differential equation at the first order in pi, Ci- The existence of a nontrivial solution (pi = £j = 0) 
requires a dispersion relation between u) and q: 
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(86) 



with A g = -\/g 2 + 1/x 2 . It is indeed sufficient to expand the previous expression to order q 4 : 
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(87) 



The straight step is dynamically unstable if U)(q) is positive in some range of q- values. Since the q term is always 
negative, there will be an instability only if the coefficient of q 2 is positive, and this requires that AF is larger than 
a threshold value given by: 



(AF) C 



2fp° 9 



(88) 



However, this threshold is physically unimportant, because it is much smaller than the value F eq the flux must 
overcome in order to allow the crystal to grow instead to evaporate. In fact (AF) cr /F eq = 2(a/x s )(a(3j), where 
(a/x s ) is much smaller than one and (aftj) is of order unity. 

If AF > (AF) cr there is a band of unstable modes, ranging from q = to q = q* . The general expression for q* 
is fairly complicated, because it is derived from a cubic equation. Nonetheless, if we are very close to the threshold 
instability the approximation ( p7|) can be used and we find: 
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The dimensionless parameter e measures the distance from the threshold. For t«l we are in the so called weakly 
nonlinear regime, and its analysis will be done in the next paragraph. 

Formula maj clearly shows that stabilizing effects are entirely due to thermal detachment: In fact uj(q) > \/q if 
Peo = 0. Such effects derive from the fact that thermal detachment is encouraged in a region of positive curvature 
and discouraged if the curvature is negative: this proc ess part ly corresponds to an effective step-edge diffusion which 
resembles the Mullins-like term introduced in chapter IV D 2 . As we will see in the next paragraph, it is possible to 
give a more solid basis to this interpretation. 

However, thermal detachment is also at the origin of the negative term in the coefficient of the q 2 term (see Eq. (p7\j): 
in the absence of p® we would simply have: (u/flD) ~ (TFo/2)q 2 . This means that step detachment 'renormalizes' 
and lowers the coefficient (tFq/2), but with the basic contribution of evaporation: in fact the threshold value vanishes 
in the limit t — > oo. In other words, step detachment is only effective at short scales if it must 'use' surface diffusion, 
but becomes effective also at longer scales if it can use evaporation. 

Now let us discuss the more general case of finite ES barriers and interacting steps. In their pioneeri ng w ork, Bales 
and Zangwill |54] considered finite ES barriers for a train of in-phase steps; afterwards, Pimpinelli et al. [ 101 1 extended 
their study to arbitrary phases, showing the importance of this extension in the case of an evaporating surface. 

The main result of the treatment of finite ES barriers is that the qualitative picture given in the previous chapter 
is confirmed: a nonvanishing fc_ implies a larger threshold value for the flux. The condition AF > (AF) cr reads 
£ < £ c = 1/2 for an infinite ES barrier (see Eq. (89)), while in the opposite limit of very weak asymmetry (k + ~ fc_) 



we have [101 1 : 
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(90) 



Here and in the following we will suppose that d + = (fast attachment from the lower terrace) and therefore d- 
is nothing but the ES length £es- In the case of a single step it should be compared with x s (£es ^ x s meaning a 
strong ES effect). With these notations, the previous value of £ c (valid for a weak ES effect) simply reads: £ c = £es/2- 



C. Linear analysis for a train of steps 



Now let us consider the stability of a train of steps. If £ is the step-step distance, the case of a single step is 
recovered in the limit £ 3> x s . In fact, in this case there is a region between each pair of neighbouring steps where 
the concentration p of adatoms has the constant value p^ and therefore the concentration profile is the one given in 
Eq. (p3|). Since the case £ 3> x s has been studied in the previous paragraph, we consider the opposite limit: £ -C x s . 

Using the results of Ref. [ 101 1, the amplification rate can be written as: 



u(q, <fy = £7A.F[(cos 4> - 1) + t z cf/2] 
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(91) 



where <f>, the phase shift between neighbouring steps, plays the role of the wavevector q y in the direction perpendicular 
to the steps. 

Separating terms of different order in g, the previous formula reads: 



u>(q, 4>) = f2AF(cos <p-l] 



( £es 



Ves 



\ QAF ( ££ ES 



V 



Ves 



ilDtpl 



2(1 



COS( 



£ 



£ + £es 



SWtpU ■ g 4 



(92) 



The quantity w(0, <fi) corresponds to a rigid translation of each step and therefore it allows to study the step-bunching 
instability: 



oj(0,(p) = SlAF(cos </>-!) 
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(93) 



This quantity is always negative (and therefore the train of rigid steps is stable) if AF > 0, i.e. if the surface is 
growing. The opposite is true (and therefore the train of rigid steps is unstable) during evaporation. Furthermore 
w(0, 4>) is maximized by <\> = (in-phase motion) if AF > and by (f> = tv if AF < 0. This means that the single step 
approximation is acceptable in the former case, but it is not in the latter one. 

In the case of evaporation, the coefficient of the g 2 -term in (p3) is always negative and therefore the g 4 -term can 
be neglected: 



w(g,7r) = 2fl\AF\ 
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(94) 



The sum in square brackets reduces to [£ 2 /2 + T£] for a strong ES effect (£es 3> £) and to [£ ES /2 + ATx 2 /£) for a weak 
one (£es •C £)■ It may also be interesting to evaluate the width of the band of unstable modes, i.e. the value q* for 



which uj(q*,ir) = 0. It is easily found that q* = 2j\J£ 2 + 2T£ for £ ES > £ and q* = {£ ES /£x s )^ £/2T for £ ES < £. 
In the case of growth the most destabilizing mode corresponds to <fi = and the rate of increase 
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(95) 



is positive when the flux exceeds a threshold, just as in the single-step case. The value of the threshold is now 

(AF)cr ^Ml(^±lY (96) 



T V WES J 

which reads (AF) cr — 2Tp® jr£ for a strong ES effect, and (AF) cr — 2Tp Q £/t£ 2 es for a weak one. The former expres- 
sion is nothing but Eq. (88), where x s has been replaced by £: this is reasonable, because the adatoms contributing 
to the motion of a given steps are the ones landing within a distance x s for a single step, and within a distance £ for 
a train of steps. A similar comparison is not possible for a weak ES effect because in this case there is a more subtle 
interplay between £, £ E s and x s . Finally, in the absence of evaporation (r — > oo) and for a strong ES effect (£ E s ^> £), 
Eq. (p5|) reads: 



w(g,0) 

which is the same as Eq. (4) of Ref. |104[ . 
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(97) 



D. Single step: weakly nonlinear analysis 

The linear stability analysis is useful in two respects: because it allows to determine for which values of the different 
parameters step-flow growth is stable and because it allows to extract the time and space scales which are relevant 
for the subsequent nonlinear evolution. In this paragraph we will consider the dynamics of a single step, just above 
the threshold of the instability, i.e. for AF/(AF) C = 1 + 2e , with e< 1, where e was defined in J89J). This analysis 



was carried out by Bena, Misbah and Valance in Ref. [102|. The calculations are a bit lengthy but instructive and 
therefore they will be given in App. |y. Here we will just explain the main lines of reasoning. 

The first step corresponds to make x, y, (,{x, t) and t dimensionless, by rescaling lengths with x s and time with t, 
and to consider the concentration u with respect to its constant value p^ = Fqt at infinity, rather than the 'absolute' 
concentration: u = p — Fqt. This way, the diffusion equation simply reads: V 2 u — u = 0. In the same units, close to 
the threshold the dispersion relation is 



u> 



nY(2eq 2 - 3g 4 / 4 ) (98) 



where V = Tp° eq /x s . 

The previous relation clearly shows that the 'active' modes (i.e. the unstable ones) are of the order of q ~ s/e and 
u> is of the order of e 2 . In other words, the unstable structure has an initial wavelength of the order of A* = l/\/e and 
it takes a time t* — 1/e 2 to develop: so A* and t* are the good space and time units for the nonlinear analysis, and 
from x,t we will pass to X — y/ex, T — e 2 t. (Note that y is not rescaled, because the step profile is only a function 
of x and t.) 

The next step is to expand both the concentration field u(X, y, T) and the step profile C(^; T) in powers of e with 
the idea to solve the problem (differential equation + boundary conditions) recursively. Since £ must vanish in the 
limit e = 0, the term in e° in its expansion is not present and it is therefore useful to put £(X,T) = eH(X,T). At 
this point, the functions u and H can be written as: u = X) n =o Mne ™ an< ^ ^ ~ E n =o^nf"- At each order of the 
integration of the differential equation V 2 w — u = a new 'integration function' A n (X,T) appears, which should be 



determined via boundary conditions. What is found [ 102 1 is that A\ and Ai keep undetermined till the third order. 



At this order, the boundary conditions imply that Hq(X,T) fulfills the differential equation 

(nry^THo = -2d 2 x H - 1%h + (d x H Q ) 2 (99) 



This equation is called Kuramoto-Sivashinsky [ 109 1 (KS) equation and exhibits spatiotemporal chaos. Saito and 
Uwaha [B9[ have performed Monte Carlo simulations on a simple cubic (SC) lattice for different values of the parame- 
ters. In particular they have pointed out the importance of the crystalline anisotropy in the step stiffness 7: an effect 
which is not taken into account in the previous nonlinear analysis and whose study needs to get to the fifth order in 



the e-expansion [110 



If the step is oriented along a direction of maximal stiffness (for example the [10] orientation for the SC lattice) 
the anisotropy does not play an important role and above the threshold (AF) cr the step displays spatiotemporal 
chaos (Fig. lc of Ref. Q ) : grooves at average distance given by X u are constantly created (when locally the distance 
is larger than 2X U ) and destroyed (through collisions and annihilation). Conversely, if the step is oriented along a 
direction of minimal stiffness (the [11] for a SC lattice) this 'stabilizes' such orientation and a more regular pattern 
appears (Fig. 4c of Ref. p9|). In the limit of strong anisotropy a periodic structure emer ges. 



The previous scenario has been explained by the same authors in a subsequent paper |110|. In the presence of an 
anisotropic step stiffness, 7 reads 

7 = 7o[l + z/(l-cos40)], (100) 

where 9 is the angle between the step normal and the average growth direction y. Accordingly, the coefficient of the 
lower-order linear term {d\Ho) in Eq. ( p9| ) is modified by the factor [1 — 8ve 2 (dxHo) 2 ]. This implies that the effect 
of the anisotropy is vanishing at the threshold instability. Secondly, a marked difference exists between v < (the 
stiffness is maximal) and v > (the stiffness is minimal). In the former case the above factor is always positive and 
anisotropy effects are not relevant. In contrast, in the latter case (y > 0) the coefficient of (O^Hq) may even change 
sign in regions of high slope and therefore stabilize the minimal stiffness orientation. 
Let us now come back to Eq. (J99|) . Its linear analysis gives the dispersion relation: 

u(q) = nT(2q 2 - 3g 4 /4) (101) 



After having reintroduced the old variables (q —> q/^/e and ui — > w/e 2 ) it takes exactly the form given in Eq. (|9_£ 
So, the linear terms of Eq. rt93) could be 'inferred' by the dispersion relation (Bq). With regard to the nonlinear 



term in the KS equation, it is the simplest and lowest order nonlinear term which does not depend explicitly on Hq 
(because of translational invariance, the dynamics cannot depend on the absolute position of the step). The sign of 
the nonlinear term is irrelevant for the subsequent analysis, because it may be changed passing from Hq to — Hq. 

It may be useful to display explicitly e in the evolution equation for the interface. In or der t o avoid confusion let 
us come back to the initial variables (x, t, Q and —in terms of these— let us define: x' — y/8/3(x/x s ), h = C/x s and 
t' = (16f2r/3)i/r. The evolution equation for h(x',t') then reads: 



d v h = (l/2)(d x ,hf 
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— 1). When AF < (AF) cr , e is negative and the step 



where e (defined in Eq. (|89|)) has the form e 
is linearly stable. The quartic term is negligible and Eq. (102) takes the form of the deterministic Kardar-Parisi- 
Zhang equation: nonetheless, since no deter minis tic instability is present it is n eces sary to add a noise term 6(x' ',£') 
to its right hand side. Karma and Misbah [ 103 1 have therefore modified Eq. (102) to take into account stochastic 



fluctuations; Furthermore, to make it valid beyond the small range e<l, t he co efficients of the nonlinear (d x >h) 2 and 
quartic (d*,h) terms become dependent on the incoming flux. Finally, Eq. ( 102 ) is generalized in the following way: 
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(103) 



where A = AF/(AF) C , fj, = (2 + A)/3 and: 



(e(x' 1 ,t' 1 )9(x' 2 ,t' 2 )) = R6(x' 1 - x 2 )5(t[ - t' 2 ) with R = y/3/8(T/ 7 x 8 ) 



(104) 



This equation has been sol ved numerically by Karma and Misbah [103] and discussed in details in a recent paper 
by Pierre-Louis and Misbah fllOTfl . Stochastic fluctuations are important below the instability threshold (when the 
straight step is linearly stable) while deter min istic ones are important above the threshold. In a small range around 



e = both are relevant. This may be seen [107] by rescaling x' and t' in order to have the coefficients of the two linear 



terms of order unity. Afterwards, the rescaling of the amplitude h will fix the relative importance of the nonlinear 
term and the noise. If we put: 



we obtain: 



dfh 



x = x/a x t = t/ott h(x,t) = a.hh(x,t) 



-a 2 x a t 1 td 2 x h - a x a t 1 /J,d~h + a h a 2 x a t 1 -(d x h) 2 + 6 



where = (ahctt) 6 and therefore: 

{9(xi,ii)Q(x2,h)) = Ra^ 2 a x at 1 5(x 1 - x 2 )S(i 1 - i 2 ) 



(105) 
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The condition on the linear terms requires that a x — y/\e\//J, and at — e 2 //i. On the other hand ah may be 
determined by fixing the coefficient of the nonlinear term ((ah) n LOt- 2 a^ A = 1) or the amplitude of the noise 
(R(ah)n 2 ot x a~[ — 1). The range of values of e where both noise and instability are relevant is determined by 
the condition that (othJNL ~ (oihjn, i-e. |e| ~ R 2 ' 7 . 



E. Nonlinear analysis for a train of steps 



Th e first study on the dynamics of a train of steps in the nonlinear regime is due to Pierre-Louis and Mis- 
bah [ 10G , 107 , 111 |, who also take into account the elastic repulsion between steps. They perform a coarse-graining 
procedure to pass from the discrete evolution equations for the steps (variables C, m (x, t)) to its continuum version. Here, 
we will li mit o urselves to observe that the resulting equation has the form of an anisotropic Kuramoto-Sivashinsky 
equation |112| which includes a propagative term. The releva nt fea ture is that a numerical study |107| shows a mor- 
phology resembling the late-time evolution of a vicinal surface 105] in the absence of evaporation, but in the presence 
of nucleation (see section VCf ) . 

In the treatment of the linear dynamics of a train of steps, evaporation is not negligible in the sense that x s = V Dt 
may be large (larger than the step-step distance), but it is smaller than the wavelength of the emerging patt ern. Thi s 
implies that the limit x s — > oo requires a different study, which has been performed by O. Pierre-Louis et al. [ 104 . 106 1 . 



The analysis of the most unstable mode (the in-phase movement of the steps) close to the threshold of the instability 
(which now corresponds to a vanishing flux) gives the following evolution equation for the step £(x,t): 



d t C, = -d x 
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(108) 



where a = eD/2 and $ = DT£. As usual , e = £IFq£ 2 /D measures the distance from the threshold: since desorption is 



absent, it is equal to the Peclet number 113 and represents the ratio of the 'step velocity' (~ Folfl) to the diffusion 
velocit y of the adatoms (~ D/£).q 

Eq. ( |lOS| ) —which is valid for the one-sided model (infinite ES barriers)— has the form of a conservation law: 
d t ( = —d x J. In fact the evolution of each step is determined by the adatoms falling and diffusing on the front terrace 
and by rearrangement of the step profile through detachment/reattachment of adatoms from the step itself. The effect 
of these processes is that Eq. (108) looks like the evolution of a one dimensional high-symmetry surface. Nevertheless, 



major differences must be stressed: the dynamics of a high-symmetry surface are only determined (in the absence 
of desorption) by surface diffusion, which would correspond in the presen t co ntext to step-edge diffusion, a process 
which is indeed forbidden in the abo ve m odel! Secondly, the form of Eq. (108) would be completely different if only 
one step was considered (see chapter VD ) and the limit x s — > oo would not be allowed. 
It is now interesting to comment on the expression for the current J: 
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(109) 



where we have introduced the local slope of the step m = d x £ in order to make the analogy with a one dimensional 
surface clearer. 

The term proportional to a plays the role of the unstable current due to the ES effect, while the second term 
should represent a current a la Mullins, but not in its 'standard' linear form (~ to"): the nonlinearity inside square 
brackets is nothing but the nonlinear chemical potential, if the surface tension a does not depend on the orientation 
(see Eq. (p3|)) . Conversely the prefactor (3/(1 + m?) represents a sort of slope-dependent conductivity which is due 
—in the present context— to 'kinetic' interaction between steps.Fj 

The latter nonlinearity plays a fundamental role, as easily seen by the study of the stationary configurations J = 0. 
In fact, the reduction of the mobility exactly cancels the same factor in the a-term so that the unstable current never 
decreases. Introducing the variable M = m/y/1 + to 2 , the equation determining stationary configurations takes the 
F(M) = —aM/y/1 — M 2 . Since \F(M)\/M is an increasing function of the slope, the wavelength A 



form (3M"{x 

of the profile is expected to decrease with th e ma ximal slope (and therefore with the amplitude ^4). This analysis is 

confirmed by the exact evaluation of A and A |104|. As a consequence of this, the system does not coarsen and the step 

develops an unstable pattern whose wavelength keeps constant and equal to the most unstable mode A u = 27rv/2/3/a. 

The st ep width increases as \fi without boun d, as can be easily checked by replacing the ansatz ((x, t) = A(t)g( x) in 
(108) [ 104, 114 1 . A numerical solution of Eq. (108) and full lattice gas simulations confirm the previous pict ure [ 104 1 . 
Finally, an in-phase meandering of (1,1,17) Cu surfaces has been recently observed by Schwenger et al. |115| ] at a 
sufficiently low temperature. 

Different and more phenomenological approaches ]105[ ] do not provide the nonlinear prefactor /3/(l+m 2 ) in Eq. ( |109| ) 
and therefore coarsening along the step should be allowed. However, the direct integrati on of the continuum equation 
does not seem to display coarsening, which is indeed found in Monte Carlo simulations flO 



F. Continuum description and step flow 

In this chapter we address the following question: is it possible to link the step-motion picture for a vicinal surface, 
with a continuum description based on the surface current? In the step-motion picture we have a train of steps 



12 It can also be read as the adatom density per lattice site, due to the incoming flux. 

13 If £ is the step-step distance in the direction perpendicular to straight steps, the shortest path between steps with slope m 
is £/(l + m 2 ). In a certain sense the adatom mobility must be reduced by the same factor in order to keep the travel time 
between neighbouring steps constant. 



whose profiles are given by £„(x,t), where x is the average direction of the steps. Let y be the in-plane direction 
perpendicular to x and £ the average distance between steps. If the steps are straight and equally spaced, then £ n = 
and the height z of the surface is simply given by: z(x, y, t) = —y fi- 
lm the general case of a step profile depending both on x and n we should pass to a continuum function £(x, y, t). 
This is allowed only if ( n (x,t) changes by a small amount from one step to the next one: this is clearly true for an 
in-phase step-motion, but for an out-of-phase step-flow —as the one which takes place during evaporation because of 
step-bunching— it is surely not true. 

For in-phase motion, it is easily found that: 

z(x,y,t) = [((x,t)-y}/£ (110) 

and 

d t z = d t c/e , m = vz = (i/£)(d x (, -i) (in) 

Let us now suppose that evaporation is negligible so that the conservation equation d t z = — V • j holds. Since j 



depends on to, and higher order derivatives, V • j = d x j x , and according to Eq. (Ill) we obtain 



d t ((x,t) = £d t z = -d x (£j x ) = -d x J (112) 

Therefore, the in-phase train of steps moves according to one-dimensional conserved dynamics via the current 
J = £j x . To make this relation more explicit we can write: 

J{d x Q = £j x (m x , m y ) = £j x (d x (/£,-l/£) (113) 

As an example, we will consider the current: 

j = K\7{V 2 z) + f(m)rh (114) 

which gives rise to the current J: 

J = £[Kd x {V 2 z) + m x f(m)] = Kd^{x,t) + /(m [l + (a,C) 2 ] 1/2 )^C(^, t) (115) 

where too = l/£. 

Since /(too) > the solution £ = is unstable and the critical wavelength is given by A c = 2tt ^ K / f (mo) ■ It may 
be interesting to compare the expression of J with what is found from a step-motion approach in the linear limit 
£ — > 0. It is clear that the dispersion relation relevant for the current J is U)(q) = a*q 2 — j3*q 4 , with a* = /(too) and 
/3* = K . The function / can be derived from the BCF theory in the limit to-o^d ^S> 1 (corresponding to the absence of 
nucleation on the terraces, see Eq. (14h). We find a\ — F £es£ 2 /[%{£ + ^es)]- Conversely, the step-motion approach 
(Eq. ([55])), in the absence of desorption gives ol* 2 = F £ 2 ES £ 2 /[2(£ + £es) 2 ] and j3* = Tp° q £D, which -for an infinite ES 



effect- reduce to the quantities a,/3 introduced in Eq. (108). Only in the same limit a\ and a\ are equal, while they 



differ for a weak ES effect. We have not been able to clarify the reason of this disagreement. 

G. Step flow and nucleation 

In this chapter we have seen that the steps of a growing vicinal surface are subject to a meandering instability, 
but do not undergo a step-bunching instability: the terrace width keeps constant and steps move in phase. In the 
introduction to the part on vicinal surfaces we raised the question of step-flow stability against 'three-dimensional 
growth'. The transition may be induced by nucleation: between two steps a new terrace comes up and —because of 
the ES effect— a mound structure tends to develop; on the other side, the upper advancing step tends to capture 
and embed the new island, therefore restoring step-flow. In a certain sense, the faster of the two processes will 
establish if step-flow is stable or not. Competition between step- flow and two-dimensional nucleation has been seen 



—for example— by Tung and Schrey [116| during Silicon epitaxy. 

The destabilizing phenomenon is not the nucleation event by itself, but the mound which can form on the new 
island. For a weak ES effect on a one dimensional surface, the probability to nucleate a second island on it before the 
incoming step captures the first one is of the order of (£/£d) 4 , a vanishing probability for £0 — > 00. In the opposite 
limit of an infinite ES effect, this probability is of order unity and therefore the instability eventually develops. 



This is clearly shown in simulations by Krug and Schimschak llOq], who also evaluate the time ti ns necessary for 



the instability to take place. They calculate tj n s fro m the condition that a mound is formed whenever an atom is 
deposited on top of a mobile adatom and find [108] t ins « tML^~ 2 (D/F ) 3 ^ 4 . We propose a different criterion: a 
mound is formed whenever a nucleation event takes place and therefore an island is formed. Since the nucleation 
probability per unit time on a terrace of size I (see App. g) is p6| , |66| P(£) ~ (1/£ml)(^/-^d) 4 , U na is nothing but 
l/P(£):t ms ^t ML (e D /£) 4 ^t ML £- 4 (D/F ). 

The previous estimates are not applicable to a two-dimensional vicinal surface because they neglect step meandering. 



Indeed this phenomenon has been shown [105 to be relevant for the late time behaviour of the stepped surface in the 
absence of desorption, whose morphology looks very similar to that obtained on singular surfaces. 

This is clearly seen in Fig. |l^ which shows the evolution of a vicinal surface: points of the same step touch and join, 
and defects are therefore formed. These ones give rise -because of the ES effect- to the formation of three-dimensional 
structures and eventually to the destabilization of the surface. 



VI. SHADOWING INSTABILITIES 



When a crystal is grown from a beam, the beam is generally not normal to the surface, but has an oblique incidence. 
Then, if a bump is formed, it may have a 'shadow' which influences the growth and contributes to making the growing 
surface unstable (Fig. 0d). The resulting profile is obviously asymmetric. In certain cases of technological interest, the 
deposit takes a columnar structure (Fig. Iq). The initial phase of column fo rmation may be viewed as an instability, 
and has been treated by the same analytical methods as other instabilities [ 118, 119 1, namely non-linear equations of 
the KPZ type. 

Columnar growth is usually observed when an intense ato mic beam is deposited on a fairly cold surface moving at 
a high velocity. This process is used in technological devices [12C-122], for instance for the manufacture of magnetic 
tapes. This technology is quite different from MBE, since neither the substrate nor the deposit are single crystals, 
and columns are separated by voids. However, in both cases ballistic deposition takes place from a beam of incoming 
particles. In MBE, one tries to avoid bump formation, so that shadowing effects are negligible. This result is obtained 
by a slow deposition rate at high enough temperature on a rotating sample. If these conditions are not realized, 
bumps or crevaces can appear as described in chapter |fV|, and then oblique incidence produces shadowing. 

The columns have a well defined direction which makes an angle j3 with the normal to the surface. This angle is 
an increasing function of the incidence angle a (Fig. Hq) . Experimentally, a simple relation 



tan 8 = — tan a 
2 



(116) 



is sometimes observed [123]. Its validity is limited to low temperatures [124]. Theoretical atte mpts to relate 8 to 
a have been made, using either the continuous (linearized) eq uations valid in the incipient stage |l24| , or geometric 



considerations app licable to fully developed columnar growth [125 126 1. The theoretical results are more complicated 
than formula ( 116 ) and suggest that j3 does not depend only on a, b ut also on the ratio of the beam intensity to the 
diffusion constant [124| or on the density of the deposited film [126]. It is noteworthy that the theories do not take 



the crystal structure into account, although the formation of columnar structures which can arise at normal incidence 
as a consequence of the Ehrlich-Schwoebel effect p6Lp8[ is related to the crystal structure. 

As mentioned above, the deposited film is porous and polycrystalline. However, the direction of the column is a 
symmetry axis of the crystals [121,126]. This property, which is of importance for technological use, does not seem 
to have received a theoretical explanation. 



VII. WETTING AND NON- WETTING 



In previous chapters it was assumed that the growing crystal is semi-infinite in the growth direction. However, 
for many applications, the growing crystal ('adsorbate') is deposited on a substrate which is a chemically different 
material. One generally wishes that the adsorbate forms a homogeneous layer with a planar surface. If it is so, it 
is said that the adsorbate wets the substrate, and layer by layer growth is possible. It is not always possible. The 
conditions of stability of a homogeneous adsorbate with a planar surface will be investigated in this chapter and in 
the following ones. 

In contrast with the previous chapters, where the instability was of kinetic nature, we shall now mainly discuss 
thermodynamics. 

A planar surface is stable at equilibrium if it minimizes the free energy. Alternatively, one can minimize the 
difference between the free energies of a state with a deformed surface (with bumps and valleys) and of the state with 



a planar surface, and the same mass of adsorbate and substrate. This difference will be called 'free energy increment' 
and must be positive for all deformations if the plane surface is to be stable. The forthcoming search for instabilities 
will be a search for negative free energy increments. 

A part of the free energy density excess is localized at the interfaces and surfaces. An obvious reason is the fact 
that chemical bonds should be broken to make a surface. This localized part is called capillary free energy, because 
it is responsible for the familiar phenomena which arise in thin ('capillary') pipes. It is present in liquids as well as 
solids. 

Another part of the free energy density excess which results from the creation of interfaces and surfaces is due to 
elasticity. It is delocalized, in the sense that it decays but slowly when the distance to the interface increases. It may 
be interpreted as resulting from a distortion of the chemical bonds in the whole system. It exists only in solids, and 
is especially important when the adsorbate takes the lattice constant imposed by the substrate. In this cas e, called 
coherent epitaxy, the topology of the crystal is preservedrj Since the misfit 5a/ a defined in paragraph I A is never 



completely equal to 0, the adsorbate is strained with respect to its natural size, and this generates elastic effects which 
will be studied in the next chapters. In the present chapter, this elastic contribution will be neglected, as it is correct 
in the case of liquids. Elasticity is presumably not very important in the growth of a solid, even on another solid, if 
this growth is incoherent (i.e. the crystal topology is broken) Pj 

If elasticity is ignored, the stability condition of a plane interface, established by Young at the beginning of the 
nineteenth century, is that the free energy per unit area a sg of the substrate-gas interface is larger than the sum of 
the free energy per unit area a sa of the substrate-adsorbate interface, plus the free energy per unit area d ag of the 
adsorbate-gas interface, namely 



'ay 



(117) 



If (117) is not satisfied, droplets form during growth. Their contact angle 9 is given by Young's formula 

cos = (a sg - a sa )/a a g (118) 

In certain cases (Volmer- Weber growth) droplets form directly on the substrate. This occurs for Pb on 
graphite(OOOl). In other cases (Stranski-Krastanov growth) a few complete layers are deposited before droplets 
form (Fig. |l7|). This occurs for Pb on Ge(lll) PJl27|. 



The droplet size is determined by kinetic mechanisms [12S[-13(|. In MBE growth, it is mainly limited by diffusion 



of the atoms deposited on the surface, which try to go to the nearest forming droplet. Since diffusion is slow, Volmer- 
Weber and Stranski-Krastanov droplets are small, much smaller than water droplets which are familiar in our everyday 
life. The size of liquid droplets is close to the equilibrium size, determined by gravity. They form through collective, 
hydrodynamic motions, not through atom diffusion. In the small solid droplets which arise from Volmer- Weber and 
Stranski-Krastanov growth, gravity is negligible and the droplet shape is fully determined by surface tensio n. In the 



case of a liquid, droplets would actually be parts of spheres, whose angle with the plane is determined by (118). In 



the case of a crystal, the shape can be determined from the Wulff construction fM. It is often rather spherical at 



high enough temperature, but it is faceted at low temperature, as mentioned in paragraph IB. 

Droplet formation caused by the interface energy can be regarded as a growth instability, but there is not much 
to say about it. If one wishes to grow a smooth adsorbate, non-wetting materials are just excluded. On the other 
hand, there are, to our knowledge, no technological applications of incoherent Volmer- Weber and Stranski-Krastanov 
droplets. However simila r dro plets can arise from a cause which is not the surface tension, as will be seen in the next 



chapters. From now on, ( |117| ) will be assumed to be satisfied, but elastic effects will be taken into account. 



VIII. COHERENT AND INCOHERENT EPITAXY: MISFIT DISLOCATIONS AND CRITICAL 

THICKNESS 



14 This statement applies to most of the cases considered in this review, but is an oversimplified generalization. For instance, 
coherent epitaxy of bec Fe on fee Ag is possible although the topology of the two lattices is different. Another example is MgO 



on Fe. The extension to these cases of the concept of coherence is straightforward 

15 Elasticity can be important for the mechanics of two solid 
or crack formation, independently of epitaxy and coherence 



15 Elasticity can be important for the mechanics of two solids glued together, since temperature variations can produce bending 



A. Misfit dislocations 

From now on, the attention will be focussed on the epitaxial deposit of an adsorbate on a substrate which has a 
fairly small misfit, say 5a/a < 0.1. As seen in the previous chapter, the adsorbate can be coherent or incoherent. 
For a sufficiently small misfit (to be precised below) the ground state of an epitaxial adsorbate of given thickness h 
is coherent if h is small enough. In this state, the crystal topology is that of a perfect crystal, in particular each 
atom has the same number of nearest and next-nearest neighbours which form the same geometrical figure with only 
slightly modified distances. 

The coherent state can become unstable if h is increased or for another reason. In the new, incoherent ground 
state, the crystal topology is perturbed at the interface. Far from the interface, a perturbation of the crystal topology 
involves a large energy (at least if h is large). Therefore, the crystal topology far from the interface will first be 
assumed to be the same as in the coherent state, although this assumption will be seen not to be always correct.! 1 ^ 



At the interface, it can be shown [131| that, if \Sa/a\ <C 1, the crystal topology is but weakly perturbed in large 
domains separated by lines. These lines, along which the perturbation is large and linear elasticity is not applicable, 
are called 'misfit dislocations'. 

These line defects are indeed dislocations of the coherent state. This can be understood if one measures the atomic 
displacements u and the strain {£07} with respect to this coherent state. Let us consider the variation 5u of u along a 
path joining the centers A and B of two neighbouring domains on the interface. If the path is through the substrate, 
Su = 0. However, along a path through the adsorbate, the variation of this displacement is the integral of the strain 
from the coherent state, and the components e xx and e yy of this strain parallel to the surface have a well-defined sign, 
that of 5a/ a. The integral can therefore not vanish. Along a closed circuit going from A to B through the adsorbate 
and then from B to A through the substrate, the variation b = 5u does not vanish. This is the typical property of a 
dislocation and b is the so-called 'Burgers vector' of this dislocation. 

An example of a misfit dislocation imaged by high resolution electron microscopy is shown in Fig. n8l 

If the crystal topology is perturbed only near the dislocation line (in the so-called dislocation 'core') the Burgers 
vector has to be a lattice vector and the dislocation is called a perfect dislocation. It turns out that, if h is not very 
large, the state of lowest energy often involves 'partial dislocations', characterized by a Burgers vector which is a 
rational fraction of a lattice vector. Partial dislocations are the edges of two-dimensional defects, e.g. stacking faults. 
For many crystal structures, e.g. the face-centered cubic lattice, stacking faults have a rather low energy, while the 
energy of a dislocation of large Burgers vector is high. Thus, it can be preferable to create partial dislocations. The 
low energy of stacking faults often results from the fact that the environment of each atom is weakly perturbed up to 
nearest neighbours, and the crystal topology is only modified if next-nearest neighbours are taken into account. 

An example is provided by intrinsic stacking faults generated by Shockley partial dislocations with b= (1/6) [112] 
in FCC lattice (Fig. |l|). 

In an ordered alloy, the two-dimensional defect related to a partial dislocation can be an antiphase boundary rather 
than a stacking fault. 

Misfit dislocations have the topological properties of ordinary dislocations. An essential difference is that they can 
be present in the ground state because their energy can become negative. Actually, the free energy associated with a 
dislocation can be written as a sum of 4 terms, 

*>tot — Jdisloc ~r •* step t J fault T J disloc/ film \*-^J 

The first 3 terms are also present in a pure system and are usually positive. The first term Tdisioc is the volume 
integral of the elastic energy associated, for 5a/ a = 0, to the strain produced by the dislocation. The second term 
Tgtep is the free energy of the step which is necessarily created (or, exceptionally, destroyed) if the Burgers vector 
has a vertical component (i.e. normal to the interface). The third term J- f au it , only present in the case of a partial 
dislocation, is the free energy of the stacking fault or antiphase boundary created by the dislocation. 

The last term fdisloc/film is negative. It is the elastic energy gain resulting from the strain relaxation in the 
adsorbate when the dislocation is created. It is responsible for the presence of misfit dislocation in the ground state. 
The free energy Ttot depends on the thickness h of the adsorbate and becomes negative when h is greater than a 
threshold h c called 'critical thickness', calculated in the next paragraph. 



16 In other words, the attention will first be focussed on perfect dislocations, while partial dislocations, to be denned below, 
are often created when h is not yet very large. 



B. Critical thickness 

Nucleated dislocations and threading dislocations 

Misfit dislocations can originate from two different mechanisms, i) nucleation of dislocations on defects, surface 
etc. and ii) propagation of preexisting dislocations of the substrate (threading dislocations). The density of threading 
dislocations may be very low if the substrate is of good crystalline and surface quality, and the nucleation of new 
dislocations is not easy since it may involve a high energy barrier. Thus, the thickness of the coherent adsorbate may 
be larger than the critical thickness calculated from thermodynamics. Moreover, the dislocations which are observed 
are not necessarily those which have the lowest energy, but those which correspond to the lowest activation energy, 
as will be seen later. Nucleation of dislocations at the surface is of particular importance in this review. 

Climb and glide of dislocations 

In the case of dislocations which are nucleated at the surface, they have to propagate down to the substrate- 
adsorbate interface, and they have a priori two possible ways to do that, which are called 'climb' and 'glide'. Climb 
implies mass transport by diffusion in the bulk and is generally possible only at high temperature. In Fig. Ell an 
edge dislocation with a Burgers vector parallel to the interface (the best way to relax the elastic energy) has been 
introduced by climb from the free surface. 

In usual growth conditions, only glide of dislocations is possible. On the other hand, in the case of layer by layer 
growth, the epilayer is continuous and the glide plane cannot be parallel to the interface. A consequence, which might 
be seen from geometry, and also results from the formulae given below, is that b is in the glide plane. On the other 
hand, in the case of layer by layer growth, the epilayer is continuous and the glide plane cannot be parallel to the 
interface. This implies that the Burgers vector has a vertical component (Figs. El] and H3). As will be seen, this 
component does not contribute to the relaxation, but has a cost in energy. 

Critical thickness for the equilibrium state. 

One can imagine two extreme definitions of the critical thickness, i) In a strictly thermodynamic sense, one might 
state that the critical thickness is reached as soon as one can find a dislocation of any kind which has a negative 
energy per unit length. This approach would lead to a strongly underestimated value of h c . ii) A complete study 
would evaluate all energy barriers which correspond to all possible dislocations. This would be rather tedious. 

Our approach in this paragraph is intermediate. The critical thickness h c will be defined as the thickness beyond 
which the energy per unit length Utot(h) of a dislocation lying at the interface is negative, provided this dislocation 
can be created by glide. This definition discards dislocations with a Burgers vector parallel to the surface plane, which, 
as will be seen, have a lower energy but can only be created by climb, an extremely slow process. As will be seen in 
chapter K], it may happen that, on a planar, high symmetry surface, dislocations do not appear, presumably because 
their nucleation energy is too high. Instead, the adsorbate forms bumps or clusters, and only then, dislocations 
appear, probably because surface irregularities allow the formation of dislocations of lower energy. 

The critical thickness h c is given by 

Utot(h c ) = U dls i oc (h c ) + sj + U fault + U disloc/fUm {h c ) = (120) 



where the 4 terms correspond to the 4 terms of (|119| ), 7 is the step energy per unit length, which has to be respectively 
added (s=l) or subtracted (s=-l) if creating a dislocation emerging at the surface creates or destroys an atomic step. 
The case s=-l, where the dislocation has nucleated on a preexisting step, lowers the critical thickness. However this 
process seems unlikely because the step would have to be a straight line whose direction is imposed by the intersection 
of the glide plane and the surface. A schematic representation of a dislocation propagating in the glide plane is shown 
on Fig. g2[ The glide system is defined by the angle (3 between the Burgers vector and the dislocation line lying on 
the interface between the substrate and the epilayer, and the angle cf) between the glide plane and the surface of the 
epilayer. The procedure is i) to evaluate the dislocation energy and the nucleation barrier for all possiblej 17 | values 
of (3 and <fi, and ii) to look for the values of these parameters which yield the lowest energy for a reasonable barrier 
height. The step (ii) will not be carried out in detail here. 



7 Here, 'possible' means that these parameters correspond to an actual glide system. 



In this paper, we will present only the case where the substrate and the epilayer are isotropic elastic media with 
identical Poisson ratio v and Young modulus E. The self-energy per unit length of a dislocation lying at the interface 
may be written [132| as 



Udisioc{h) = Eb 2 



l — i/ cos 2 (3 ah 
8tt(1 - i/2) n( T 



(121) 



where a depends on f3 and (f>, and on the core of the dislocation, a region where linear, continuous elasticity does not 
hold, since it would predict infinite strains and stresses. The value of a is controversial. The estimates for (3 = 60° 
and b — (1/2) < 1 10 > lie between 0.6 [133] and 2.0 1 134 ] . In works related to Si-Ge straine d lay er structures, a 
is often taken [132 1 equal to 4. The expression of the dislocation energy in Matthews' paper [135] corresp ond s to 
In a = 1. A precise expression of the energy of a dislocation near a free surface has been obtained by Freund [136|. It 
relates Udiaiocih) to <f>, the core radius r c and the core energy i7 c ore whic h can be evaluated in semiconductors, using 



atomistic simulations based on empirical interatomic potentials [137,13£ 

The third term of (120), present in the case of a partial dislocation only (Fig. 
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(122) 



where &f au it is the ene rgy per unit area of the stacking fault or antiphase boundary associated with the dislocation. 
The fourth term of (120) is negative and proportional to the misfit, 



U, 



dislocf film 



(h) = 



-E 
l-i/ 



hb sin (3 cos < 



(123) 



The factor 6sin/3cos</) can be justified as follows. Let the y axis be chosen parallel to the dislocation, z normal to 
the interface, and x perpendicular to z and y. With respect to the free material, the coherent adsorbate is compressed 
(if 8a/ a > 0) or stretched (if 8a/ a > 0) in the directions parallel to the plane. The effect of a misfit dislocation is to 
relax a part of this constraint. The amplitude of the relaxation is measured by b x = b sin (3 cos <p. The components b z 
and by measure respectively a strain perpendicular to the interface, and a shear, which both do not contribute to the 
relaxation]^ 

Using formulae (121) to (123), the equation for the critical thickness (|120|) can be rewritten asp] 
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(124) 



In Fig. E3L the dependence of the critical thickness on the misfit is displayed in the case of a perfect dislocation 
(1/2)[101] with the glide plane (111) in a FCC lattice (case of Fig. [p. For this glide system, (3 = 60° and <j) = 54.7°. 
The step energy has been neglected. The two values of a used in Fig. |23| correspond to two typical values of the 
core energy, which depends on the material. The critical thickness is seen to decrease rapidly with increasing misfit. 
For a misfit of about 4% the critical thickness reaches the lattice parameter and continuous elasticity theory must be 
replaced by calculations taking into account the discrete nature of atomic layers. 

In metals the predicted and experimental values of h c agree fairly well and are not very large in most of the cases 
which have been studied. However the critical thickness may be much larger than the prediction in some materials. 
For example, under certain growth conditions, Ge/GaAs remains coherent up to 2 /im, while the theory gives 300 
nm [139]. This discrepancy is presumably related to the large value of h c . 



18 A detailed calculation wou ld make use of formula (149) of chapter DO. 

19 If the denominator in (124) is negative, the critical thickness for the particular type of defect of interest is infinite, and that 



type of defect is discarded because the energy cost of the fault is larger than the gain due to relaxation. 



C. Nucleation of a dislocation at the growing surface 

If the density of preexisting dislocations in the substrate is not sufficient to relax the stresses during growth by the 
extension of threading dislocations, dislocations must be nucleated, and this involves an activation barrier. Possible 
sources for dislocations are: 

- Homogeneous nucleation of half-loops (whole or partial) at the free surface of the epilayer. 

- Homogeneous nucleation of half-loops at the substrate/epilayer interface. 

- Heterogeneous nucleation of complete loops at a growth defect in the bulk of the epilayer. 

- Nucleation of half-loops at the edges of islands in 3d growth or at the free surface defects. 

- Multiplication of dislocations, for instance by cross-slip. 

The homogeneous nucleation of a dislocation half-loop at the free surface (Fig. E4) is energetically more favourable 
than the homogeneous nucleation of a complete loop at the substrate/epilayer interface |140|| . The formation e nerg y 



Ttot of a h alf-loop of radius r on a plane inclined at an angle <j) with respect to the surface has the form (119) 



where [ 141 



The self-energy is approximately equaPI to 



E {1-vjl) rb 
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T dlsloc = g(1 _V 2) In(-) (125) 

• The step energy is 

T st ep = 2rsj (126) 

where s = 1 if a step is created, s = — 1 if a step is annihilated. 

• The stacking fault free energy (in the case of a partial dislocation) is 

9 

7r r _ 

F fault = —Z~ O 'fault (127) 

• The elastic energy relieved by the loop is the work done by the biaxial stress {<r a7 } applied to the area of the 
loop for a displacement equal to the Burgers vector b : 

2 

Fdisioc/fiim = -Area x ^n a>7 cr a7 6 7 = — bcr sin/3 sin0 cos0 (128) 



ot-y 



where n is the unit vector perpendicular to the area of the half loop. Near the surface, the n on-vanishing stress 



components are only a xx — a yy — er, proportional to Sa/a as seen below from formula (131) 



In addition to the factor sin/3 cos</>, already present in (123), formula (128) contains the factor sin0. Thus, only 



inclined glide planes (cf> ^ or tt/2) can be invo lved in the nucleation process. 

As r increases from 0, the total energy (|l!9| ) of the half-loop increases, reaches a maximum for r = r c and then 
decreases. The critical radius r c is then given by dTtot/dr = or 

r _ E (1 - v/2) b 2 [1 + ln(ar c /b)] + 16 (l - v 2 ) s 7 
8 7r (1 — v 2 ) (a b sin/3 sin <f> cos <j> — 5 fault) 

This gives the energy barrier for dislocation nucleation, 

y.-r.,,- ^"-'^?^- 1 ' (130) 

16 (1 — v*) 



There is a close parallelis m b etween the formulae of this paragraph and those of paragraph VIII B. For instance, formula 



(125) can be obtained from (121) by multiplying by the half-loop length nr and replacing cos /3 by 1/2 and h by r. 



At the beginning of the relaxation, the biaxial stress for an isotropic material is 

-E 5a 
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v a 



(131) 



If 7 and a fault are neglected as well as logarithmic factors, insertion of ( |129| ) into (13C) shows that T c varies a s b 3 . 
This favours nucleation of partial dislocations, which have smaller Burgers vectors. For instance, Dynna et al. [ 142 ] 
calculated the loop energy as a function of the loop radius for the epitaxial system AusoNi2o/Au(001) for Shockley 
partial 90° dislocations (1/6)[112] and perfect 60° dislocation (1/2)[101] (see Fig. 25). At this composition, the misfit 
is 2.3%. The energy barrier for the nucleation of partial dislocations is clearly lower than that of perfect ones. This 
result may explain why stacking faults and twins have been observed in this system and similar ones. 



D. Relaxation by an array of dislocations 



Above the critical thickness, it is energetically favorable to introduce dislocations. The number of dislocations is 
limited by their interaction and depends on the substrate thickness. The epitaxial strains are progressively relaxed 
by the formation of a lattice of dislocations located at or near the interface between the epilayer and the substrate. 
The equilibrium density of dislocations is reached when the total elastic energy Ttot is minimum. If, for the sake of 
simplicity, the step and fault energies are neglected, Ttot can be written as the sum of the elastic energy Tfu m of the 
homogeneously strained film, the self-energy of the dislocation lattice T disioc-, the interaction T&. 
dislocations and the film, and the interaction T disioc / 'disioc between dislocations. Thus 
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(132) 



In homogeneous and isotropic materials, the first three terms are rather simple, and can be written readily. Indeed, 
T film is the elastic energy of a film strained biaxially by the substrate : 
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(133) 



The self-energy and the dislocation/film interaction energy are proportional to the quantities V T disioc and Udisloclfilm 
given by (pi)) and fll23|). 

In the case of an epilayer relaxed by two orthogonal periodic arrays of dislocations, one has 



2A 

J~ disioc ~r J~ disioc/ film — ~~ "T" \y disioc ~r U disioc/ film) 



(134) 



where d is the average distance between dislocations and the factor 2 appears because of the 2 dislocation arrays. The 
most complicated term in (132) is the interaction between dislocations, Tdisioc/ disioc- It depends on the distribution 



of the dislocations at the interface, which can be periodic or not. Furthermore, due to oblique glide plane, the Burgers 
vector has a component perpendicular to the surface. This component does not contribute to the relaxation. Two 
possible arrays of misfit dislocation may be considered: 



1. An array of dislocations characterized by the two angles (3 and <fi defined in paragraph VIII B, all dislocations 
having the same orientation. 

2. An array of dislocations characterized by (3 and <f> in which those components of the Burgers vector not relieving 
misfit strain alternate in sign. This case is represented in Fig. Eq. 

Exact expressions have been derived for t he energy of an infinite array o f ide ntical periodic dislocations taking into 
account the interactio ns be tween them [143]. Using this theory, Jain et al. [ 144 1 calculated the strain relaxation. More 
recently, Dynna et al. [145| gave a closed for m of the total energy for an alternating dislocation array. Their evaluation 
was based on Head's analytical calculation |14(| of the stress field for a dislocation in a semi-infinite, isotropic body. 



Their results will be presented below, but let us begin with the earlier theories [139,147,148] which do not take into 
account the inhomogeneous part of the stress and strain fields associated with dislocations interactions. The total 
energy is written as the sum of the elastic energy of the epilayer homogeneously strained to e res (residual elastic 
strain) and the energy of the set of dislocations. 



•*tot — J~homo\£r 



Pdisloc C* disloc-A 



(135) 



where Thomo is the energy of the homogeneously strained epilayer and 
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(136) 



is the dislocation density. Thomo is similar to Tfu m in (I133J), but the misfit 8a/ a must be replaced by the residual 
strain e res . The difference between J-homo and J-fu m takes into account both interaction energies T L 



disloc/ film 



and 



3~ 'disloc /disloc m a mean field way i.e. on ly th e homogeneous part of the strain and stress due the dislocation array is 
considered. Then using (135, 136) and (123) the total energy can be written as : 
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where the cut-off radius, R cu t is the minimum value of the thickness h and the interdislocation average distance d. 
Fig. |27] displays the energy versus the residual strain (lower axis) and dislocation density (upper axis) for several 
adsorbate thicknesses. Below the critical thickness, the energy exhibits a singular minimum for a residual strain equal 
to the misfit and zero dislocation density. Above the critical thickness, the equilibrium occurs at e eq for a finite 
dislocation density. The critical thickness corresponds to the curve with a horizontal tangent when the residual strain 
is equal to the misfit. 

When the inhomogeneous part of the stress and strain fields generated by the dislocation array is taken into account 
to calculate the interaction energy, the expression of the total energy is much more complicated. We present here 
an example of such a n expression for the case of an epilayer relaxed by two orthogonal periodic arrays of alternating 
dislocations [ 142,149) (Fig. ^6|). The interaction energy is : 
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where the factor 2 arises from the two sets of orthogonal dislocations, V^™ oc arrav is the interaction per unit length 

of one dislocation with all other parallel dislocations and E^Ksioc array 1S ^ ne interaction of one dislocation with all 
perpendicular dislocations. Their expressions are 
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and 
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Such an expression has been used to calculate the relaxation of MgO grown on the (001) surface of bcc iron 
at room temperature [149|. In this system the misfit is about 3.8%. The results of the measurements and the 
calculations are presented on Fig. |28|. The open circles correspond to the experimental values of the relaxation as 
measured by electron diffraction (RHEED) during the growth. The glide system consists of perfect edge dislocations 
of Burgers vector (1/2) < Oil > gliding on ( 101) planes. The curve called 'equilibrium calculation' corresponds to 
the minimization of the total energy J- tot (eq. 132) with respect to the distance d between dislocations. 



It is clear that the calculated relaxation within the equilibrium model is overestimated. The nonperiodic nature 
of the real dislocations arrays may explain this discrepancy. Indeed, even if the nucleation is homogeneous and the 
initial dislocation distribution is periodic, it is difficult to maintain the periodicity as the thickness and the number of 
dislocations increase. The periodicity can be maintained only if all existing dislocations move continuously to smaller 



spacings to make room for new dislocations. It has been shown [150] that this process requires more energy than the 



introduction of new dislocations in a nonperiodic way. Numerical calculations have been performed by Jain et al. 



[144 1 involving a uniform-random or a Gaussian dislocation distribution showing that the energy of the dislocation 
arrays is increased with respect total energy of periodic arrays having the same average dislocation density (i.e. the 
same relaxation ratio). This leads to a lower or slower relaxation rate. However, the real distribution of dislocations 
depends on the mechanism of nucleation and propagation. The nucleation may be the limiting process, when no 
threading dislocations are expected from the substrate of high quality. Nucleation plays also an important role in 
selecting a particular glide system of dislocations. The propagation of dislocations may also be the limiting process 
due to inhomogeneities of the stress field present in the layer where the dislocations are moving. The inhomogeneities 
of the stress field may result in a retaining force which may block the dislocation propagation. Such a phenomenon, 
when the source of inhomogcneous stresses are the other misfit dislocations, is called 'blocking mechanism'. It is the 
subject of the following paragraph. 

E. Blocking mechanism 



Just above the critical thickness, dislocations propagate to form two arrays of orthogonal dislocations. Freund [136] 
has shown that the interaction between a moving dislocation T>\ and an orthogonal dislocation T>2 impedes the motion 
of the segment T>\. Then this results in a lower relaxation rate than predicted by previous equilibrium calculations. 
This is shown schematically on Fig. £9]. A moving dislocation T>\ encounters a misfit dislocation T>2- The stress field 
generated by the blocking dislocation can completely suppress the traction force due to the residual stress in the 
cpilayer. To bypass the blocking dislocation, T>\ may alter its path by moving in the same glide plane but closer to 
the surface i.e. in a channel of width h* < h. This reduction of the thickness reduces the driving force acting on T)\. 
Furthermore, the retaining line tension increases due to interaction with the surface (image forces). When T>\ passes 
above 2?2, three forces are acting on it : 

- the driving force due to the residual homogeneous strain e res 

- the retaining force due to the line tension of T>\ 

- the interaction force associated with the stress field resulting from T>2 ■ 

Let us express the retaining forces acting on the element of T>\ in terms of the residual strain needed to just 
compensate these forces. 

i) The residual strain eu ne which exactly compensates the dislocation line tens ion can be found readily within the 



homogeneous strain approximation of the relaxation by minimizing Ttot in (137) with respect to e res , and replacing 



h and R cu t by h* (which corresponds to neglecting interactions between dislocations) : 

(l-,cos 2 /3)frlog(^f) 
»7r (1 + v) h* snip coscp 

ii) The residual strain, e res compensating the stress field ctd(x,z) of 2? 2 can be expressed in terms of the forces 
resolved in the glide plane acting on an elementary segment of the dislocation: 



O res,a*y ">-] 



o\d, Q7 n 1 b a + } cr res , al n 7 b a = (142) 



where n — (l/v^) < HO > is the glide plane normal unit vector and n = (1/2) < 110 > the Burgers vector. The 
homogeneous biaxial stress tensor is related to the residual strain by : 

(143) 

Now, the residual strain compensating the stress due to the blocking dislocation T>2 for an elementary segment of 
T>\ passing at the position (x,h*) may be written for this glide system as : 

e res (x, h*) = — - — [(u- 1) <7 D ,xx(x, h*) + va D ^ zz {x, h*)\ (144) 




To bypass the orthogonal dislocation stress field in the channel of width h* , the residual strain eo(h*) has to be 
greater than e res (x, h*) for every position x. Then eo{h*) is the maximum value of e res (x, h*) when x is varying, this 
condition may be written : 

eo(h*) = e res (x max , h*) (145) 



Ox 



= (146) 



Since linear elasticity theory has been used, the two residual strains required to balance both retaining forces should 
be added to obtain the total residual strain as a function of the channel width h* . Since the channel width is a free 
variable, the actual residual strain below which the moving dislocations are stopped is : 

train = Mui h « [e D (h*) + e line (h*)] (147) 



Relations ( |145| ), (146) and ( |147| ) constitute Freund's blocking criterion. They mean that the residual strain corre- 



sponds to a saddle point of the function e res (x, h*) + eu ne {h*) in the two-dimensional space (x,h*). It has been found 



[151 1 that Freund's blocking criterion predicts a residual strain which is in good agreement with the strains found 



experimentally in Sii-^Gcj, film on Si(OOl). We have applied this blocking criterion to the relaxation of MgO g rown 



on Fe(OOl) presented above ]149| . The expressions of th e stresses due to the blocking dislocation may b e der ived [15C1 
from the Airy stress function as done by Freund |13£] or more directly obtained from Dynna et al. [145|. Fig. |30| 



displays the residual strains e_o(/i*), eu ne (h*) and the sum of these, e to t{h*) for a 100 A layer of MgO deposited on 
Fe(001). Both residual strains behave as expected. The residual strain compensating the line tension is maximum 
near the surface and then decreases when the channel width, h* increases. On the contrary, the residual strain com- 
pensating the orthogonal dislocation stress field increases as the channel width approaches the layer thickness h. The 
minimum of the total residual strain occurs at about h* — 70 A and e tot — 1.5%. Then at this layer thickness, the 
expected relaxation is estimated to 2.1% (the misfit between MgO and Fe is 3.6%). This value is in good agreement 
with the experimental one. On the Fig. |28|, the relaxation calculated from the blocking criterion is represented. It 
is in better agreement than the equilibrium one presented above at least for high thicknesses. Although Freund's 
criterion predicts no relaxation when h < 27 to 30 A, relaxation is clearly observed experimentally when h > 20 A. 
This relaxation is possible because the density of blocking misfit dislocations is insufficient to block all of the threading 
segments present in the MgO film. Above 50 A the agreement between theory and experiment improves continuously 
and the agreement is excellent at about 100 A. The experimental results presented in Fig. |2q correspond to room 



temperature growth. For layers grown at higher temperature, the relaxation is faster [149 150 1. This can indicate 



that nucleation is enhanced or that the blocking barrier is overcome by thermal activation. This example shows that 



other aspects of relaxation must be taken into account, e.g. the frictional stresses [152], the non-periodic nature of 



the dislocation array [144] or the multiplication mechanism [153]. The prediction of the relaxation rate depends on 



the growth conditions and is currently an active field of research. 

IX. THE ASARO-TILLER-GRINFELD INSTABILITY AND RELATED PHENOMENA 
A. Three-dimensional clusters as an alternative to misfit dislocations 

1. Experimental facts 

If a crystalline material is coherently adsorbed on a substrate which has a different atomic distance, it does not like 
it and tries to escape from this unpleasant situation which costs energy. There are several ways to escape. The most 
usual way is to make misfit dislocations (Fig. |3l|a) as seen in the previous chapter. However, mismatched epitaxial 
films can relax their strain through a different mechanism, namely a deformation of the surface or '2d-3d transition'. 
In III-V semiconductors, for example, this frequently occurs for a lattice mismatch above typically 2% in layers under 
compressive stress and for lower mismatch in layers under tensile stress. If the misfit is moderate, this deformation 



leads to alternating hills and valleys (Fig. |3l| b and 32) while for a stronger misfit, the adsorbed film splits into clusters 
(Fig. |33| ). These clusters will be studied in the next chapter. 

The base of each hill is still highly constrained by the substrate, and the elastic energy density is high. However, 
at the top and on the sides of the hills, the adsorbatc lattice constant is closer to that of the free adsorbate: the 
adsorbate is said to be 'relaxed'. The adsorbate is still coherent although dislocations always form in the later stages 
of the growth. The gain in elastic energy at the top of the clusters is the reason of the deformation, as will be seen 
more quantitatively in this chapter and in the following one. 



For a given substrate and a given adsorbate, stress relaxation can occur through misfit dislocations or, without 
loss of coherence, through surface deformation according to the growth conditions. For instance, in InAs/GaAs or 
InAs/In P systems , cluster formation precedes dislocation formation on the (100) substrate, a nd on several high index 



surfaces [ 155- 157| . On the contrary it is not observed on the other low index surfaces, e.g. [1581 (110), (lll)A and 



111 )B. A Ge deposit forms coherent clusters on Si(100), while it directly forms large incoherent clusters on Si(lll) 



[ 159 1 . The substrate orientation obviously determines the surface energetics and kinetics. This hesitation of nature 



between the two kinds of instability has been analyzed for instance by Joyce et al. [158| and is not very well understood. 
Nevertheless, we will try to understand a part of these phenomena. 

2. Energetics 

It is of interest to wonder about the ground state of a coherent, epitaxial film, even though it is never reached. 
Let us first ignore the interface energy and assume that there is only a surface energy and an elastic energy. For a 
large thickness, the elastic energy is proportional to the volume and dominates the surface contribution, so that the 
adsorbate tends to separate from the substrate and to form a single big cluster almost detached from the substrate 
(Fig. HU, last picture). This state obviously minimizes the elastic energy, and therefore the total free energy. 

In reality, the ideal ground state can of course not be reached since the atoms would have to diffuse on very long 
distances, but, if the deposit is thick enough, big clusters can form and lower the energy. 

The waves of figure pip and 32 are presumably the incipient stage of the instability which leads to isolated clusters. 

However, it turns out that big, coherent cl usters (Fi g. p4|, last two pictures) are never observed. Dislocations appear 



in the cluster when it reaches a critical size 1 16C - 163|] (Fig. |33| ) but the above argument helps us to understand why 
there is an instability, and also to predict the evolution of a population of clusters -a problem which will be addressed 
in chapter [X|. Moreover, even though shapes corresponding to the last two pictures of Fig. |j are neve r observed, the 



shape of coherent clusters does depend on the size, in contrast with incoherent clusters of chapter VII. In particular, 
it will be seen in the next chapter that small clusters, dominated by the surface energy, are 'platelets' (Fig. [34|, first 
picture). 

To close these qualitative remarks on energetics, it is appropriate to consider the interface energy. When one wishes 



to make good epitaxial layers, one needs to have a strongly negative interface energy a sa , so that (117) is satisfied. 
This highly negative value is obtained by the choice of the materials, of the temperature, the fluxes, the growth rate, 
etc. If the interface energy is sufficiently negative, the fi rst a dsorbed monolayer(s) is/are stable even if the misfit is 
fairly strong. This stable layer is called a 'wetting laye r' |]164| . It is sometimes related to complex chemical reactions, 
e.g. in InAs on GaAs, where a (In,Ga)As alloy forms |158f| . 

In the simplest cases, the existence of the wetting layer implies only a small modification of the above discussion. 
Cluster formation and wavy deformations do lower the free energy, but only after the wetting layer has formed. This 
picture applies to Ge/Si. In more complicated cases, there is experimental evidence that a considerable transfer of 
matter takes place from the wetting layer to the clusters. This seems to be the case in InAs/GaAs(001). This feature 



can be taken into account by simple theories [165| but, for the sake of simplicity, it will be ignored in the remainder 
of this review. 

3. Kinetics 

The thermodynamical analysis is probably not sufficient to determine the structure of an epitaxial deposit. Kinetic 
factors are presumably often essential, for instance for the choice between misfit dislocations and a three-dimensional 
modulation. The crystal chooses the fastest relaxation path in the phase space. The formation of misfit dislocations 
requires, as seen in the previous chapter, a purely microscopic atom motion inside the material, on a distance of the 
order of the atomic distance. In contrast, the formation of a three-dimensional structure involves atomic diffusion 
on the surface (which is much easier than in the bulk) but on much larger distances, typically hundreds of atomic 
distances. It is difficult to say which mechanism will be preferred. 

A way to make the discussion more quantitative is to calculate the activation energy. That of misfit dislocations 
has been calculated in the previous chapter. The activation energy for the formation of a coherent cluster will be 
evaluated in the next one in the case of a singular surface. 



Jf.. Singular and non-singular orientations 
In this chapter, the onset of the 2d-3d instability will be studied in the only case where a quantitative theory 



exists, [166,167] namely the case of a non-singular surface (T > Tr, see chapter |). 

Unfortunately, most of the experiments and most of the technological applications, are concerned with (001) surfaces 
which are believed to be singular at the growth temperature. Nevertheless, experiments on non-singular surfaces would 
be feasible, and this justifies the forthcoming study. Moreover, the wavy deformations observed in weak misfits are 
in qualitative agreement with the theories of this chapter, as will be seen. 

It will be shown in the present chapter that a plane surface of non-singular orientation is unstable with respect to 
spontaneous deformations of wavelength larger than some value A c . This value will be evaluated. This instability of a 
non-singular surface will be found to involve no activation energy, although the time of establishment of the instability 
will be seen to be very long for small misfits. The case of singular surfaces, e.g. the (001) surface of body centered 
cubic crystals, will be investigated in the next chapter and their instability will be seen to be subject to an activation 
energy. 

B. The Asaro-Tiller-Grinfeld instability: thermodynamics 



The stabil ity o f an initially planar s ystem w as investigated first by Asaro and Tiller [166| and later in great detail 
by Grinfeld g67J and other authors |p|, ^68| , |l69| . 



The simplest possible approach is the following: one calculates the energy of the system (substrate + adsorbatc) 
when its surface has a weak sinusoidal plastic modulation of amplitude Sh (Fig. paa) and compares this energy to 
that of the planar surface (Sh = 0). Here, the word 'plastic' means that the height modulation results from an 
irreversible modulation of the number of atoms, in contrast with an elastic strain.F] If the modulation produces an 
energy decrease, the planar surface is unstable. The method is in fact applicable to any weak modulation, because 
such a modulation can be Fourier transformed, and the strains produced by the various Fourier components below 
the modulated region can be calculated by linear elasticity theory and can just be added as in any linear problem. 
This additivity rule is not applicable in the modulated region but it does not matter since this region is very thin. 
This approach, in which the problem becomes linear, is called linear stability analysis. This method has already been 
used in chapters IV and |y|. 



In this approach, the height of the surface can be assumed sinusoidal, namely 

z(x, y) = (z) + Sh cos(qx) (148) 

where q is a real, positive number. 

Before going further, one should mention that the assumption of a sinusoidal, and therefore continuous modulation 
is acceptable only above Tr. Below Tr, continuous height changes are not possible as noticed in chapter |[ In the 
remainder of the present chapter, the temperature T will be assumed higher than Tr. The case T < Tr, which 
presumably corresponds to a defect-free, high symmetry crystal surface, will be add ressed in the next chapter DO. The 



applicability of the various theories to real surfaces will be addressed in paragraph IX D 



It will be seen that (if gravity is neglected) the modulation (148) lowers the energy if q is small enough, so that the 
planar surface is always unstable. 

This energy gain comes from the fact that, in the modulated part of the adsorbate, one can strain the material, so 
that its lattice constant becomes closer to the one it would like to have (i.e. that of the free adsorbate). 

The free energy can be most conveniently written if the strain is measured with respect to the completely strained 
state, in which the lattice constant in the x and y directions is that of the substrate£J If it were measured from the 
free adsorbate, it would be discontinuous at the interface. The free energy change resulting from the strain can be 
written as 

STi = -Const x — / d 3 r[e xx (f) + e yy (r)] (149) 

a J A 



21 The word 'plastic' is often understood as implying dislocation formation. It is not the case here. 

22 If it were measured from the free adsorbate, it would be discontinuous at the interface. Thus, the quantity denoted e in 



this chapter and the following ones was called 8a/ a + e re3 in chapter VIE 



where J. denotes the integral over the adsorbate. The strain e a -y(r) is measured with respect to the completely 
strained, coherent state, in which the lattice constant in the x and y directions is that of the substrate. A positive 
misfit corresponds to an adsorbate bigger than the substrate, so that the coherent adsorbate is compressed by the 
substrate (in the directions parallel to the surface) . Relaxing this compression corresponds to a positive strain (from 
the completely strained state) and to a negative 8T\. The co nsta nt depends on the elastic constants of the adsorbate. 
The integral on x and y vanishes for complete layers, so that ( |149| ) is proportional to the thickness Sh of the modulated 
region. It is also proportional to the strain modulation amplitude e, the misfit and of course to the area A. Therefore 



8T X = -dEJh—eA 

a 



(150) 



where E a is a typical elastic constant (e.g. the Young modulus) of the adsorbate and C\ is a positive constant 
of order unity. The detailed calculation for an isotropic elastic mediumFj is available in original articles and text- 
books PJl6r^l69[]. The result is 



d - 1/(1 - i/„) 



(151) 



where v a is the Poisson ratio of the adsorbate. 

The energy gain (150) is partially compensated by the quadratic part of the elastic energy, which can be written as 



<*,7>S>C 
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(152) 



where a, 7, £, £ =x,y,z and fl^ *,, are the elastic constants, which are (known) functions of r because they are not 
identical in the substrate and in the adsorbate. 

The exact minimization of the total elastic energy SJ-\ + 8T1 would be a hard task, on which mathematicians of the 
nineteent h century , e.g. Green and Dirichlet, have concentrated their efforts. For a small modu lation Sh, the problem 
is easy ]3, 166- 169 1. The strain which minimizes ST\ + ST2 when the surface is given by (148) is expected to be the 
real part of an expression of the form 



^07 l^x, y , z j 



e° 7 exp(igx)/(z) 



where e° and f(z) can be determined from the equations of elasticity which have the form |l7l| 

5>IW^ c = o 

7«C 



(153) 



(154) 



According to (153), the operator d x should be replaced by iq and d y by 0. Therefore, ( |153| ) can be solved by 
replacing d z by a^q, where the constant ckq depends on the elastic constants, and should have a positive real part 
since the perturbation has to vanish for z — — 00. In the case of an isotropic elastic medium, ao = 1 [PMI- in a ^ cases, 
the penetration depth of the perturbation is proportional to the wavelength 2ir j q. 

The quadratic part of the elastic energy in the bulk is thus proportional to 1/q. In addition it is proportional to e 2 
and to the area A. Finally, 



5T 2 = C 2 Ee 2 A/q 



(155) 



where E can be chosen as the Young modulus of the substrate, and the constant Ci is of order unity. If the wavelength 
is much longer than the adsorbate thickness h, Ci does not depend on the elastic constants of the adsorbate. 
In the case of a isotropic solids P,0| , 



EG, 



E 



l-v 2 



(156) 



where v and E are the Poisson ratio of t he ad sorbate (if qh 3> 1) or of the substrate (if qh <C 1). If both materials 
have identical elastic constants E and v, (|l56|) applies without restriction. 



The ca lcula tion might be done for a cubic crystal bounded by a (001) surface, using the corresponding elastic Green's 
functions ]i70]. 



Minimization of the total elastic energy 5T e i = 5J-\ + 8T2 with respect to e yields (assuming E a — E, which is 
sufficient for our qualitative analysis) 



qCi 5a 



and the total elastic energy 5F e i = 5T\ + 8J-2 is 



^, i = _^M^, EA 



4C* 2 V a 



(157) 



(158) 



Thus, the elastic free energy change resulting from the sinusoidal deformation is negative. 

However, there is another contribution to the free energy, which opposes the sinusoidal modulation. Indeed, this 
modulation increases the surface area, and therefore the surface energy or 'capillary' energy. The area increase is 
easily seen to be A8h 2 q 2 /4, and the capillary energy increment 5J- cap is obtained by multiplying by a coefficient a, 
called surface stiffness. Hence, 



SFcap = Aa6h 2 q 2 /4 



If the strain is small, its effect on a can be neglected and the total energy is the sum 



Tu 



Fc, 



J~ e> 



Thus, the total energy increment per unit area is obtained by adding (158) and (|159|), and equal to 



4G 2 V a 



(159) 



(160) 



(161) 



We conclude that, if a material is adsorbed on a substrate which has a different lattice parameter, its surface cannot 
be planar. The planar shape is unstable with respect to modulations of wavelength larger than 



A* 



2irC 2 a 
~CfE 



\SaJ 



(162) 



since (161) is negat ive i s that case . 

It follows from ( 161 ) and ( 162 ) that the free energy gain per unit area is proportional to {o~a/a) A for a given 
amplitude 8h. 

In the instability discussed in this paragraph, the effect of the substrate is merely to produce an anisotropic, external 
stress. Such a stress can also be produ ced mechanically. This is the situation addressed for instance by Nozie res [172|, 
Grilhe [173] and Kassner & Misbah [174] in their theoretical, nonlinear analysis, and by Thiol ct al [175] in their 
experiments on He. An instability arises in both cases, but the long time evolution is quite different. In a mechanically 
stressed homogeneous material, the amplitude of the surface modulation can be very large. In contrast, when the 
stress is due to a substrate, the modulation is not expected to penetrate into the substrate. The former situation will 
not be addressed in the following, while the c onseq uence of the presence of the substrate-adsorbate interface below 



the free surface will be discussed in paragraph IX E 



If, as we have just seen in t his paragraph, the initially planar surface is unstable with respect to modulations of all 
wavelengths longer than ( 162 ), the actual structure which will appear is not yet clear. For short times, one can expect 
that the wavelength which will dominate the actual structure will be the one which develops more rapidly than the 
other ones. Thus, the preceding thermodynamic study is not sufficient and a kinetic treatment is necessary. 



C. The Asaro-Tiller-Grinfeld instability: kinetics 



1. Classical theory near equilibrium 



Without calculation, we expect that fluctuations of longer wavelength require a longer time to form, since the 
atoms should co me f rom longer distances to form the mo dulation . A more precise study of the evolution in time of 
the modulation ( |148| ) was initially done by Spencer et al. 176,177] and is reproduced below. 

The translational invariance of the problem implies that, at short time, a small initial sinusoidal deformation remains 
sinusoidal, namely 



z(x, y, t) = (z) + Sh(t) cos(qx) 



(163) 



Insertion of (|jj) into fll63j ) yields, neglecting the noise, 



— — coa(qx) = —dj(x,t)/dx 



(164) 



where the surface current density j (x, t) is usually assumed [176| to be related to the chemical potential /i by a linear 
relation as explained in paragraph [V D 2 , 



j(x,t) — —Td/j,(x,t)/dx 



Relations (164) and ( |165| ) imply that the chemical potential /i has the form 

n(x,t) — jUi (t ) cos(qx) 



(165) 



(166) 



where the function fii(t) is related to 6h(t) by the following relation which follows from formulae (164) to 0169 ) 

dSh 



dt 



= -TgVi(t) 



(167) 



On the other hand, /i(cc, t) is the free energy per atom, so that the free energy variation per unit time is, if a 2 c is 
the volume per atom 



f = (a*cyi J 'dxdy^tf-^ = (a 2 ^^ 



dt 



2 ! """ViW / ^dycos 2 (gx) = i(a 2 c)-U^ (<) 
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(168) 



Now, dT/dt can be related to the free energy calculated in paragraph IX B for a non-singular surface. In order 
to extend the theory to singular surfaces, we shall assume, following textbooks [178] that, as in any system, the 
free energy of weak fluctuations is quadratic with respect to the variables which describe the system, which are here 
the height variations. Because of translational invariance, it is appropriate to use the Fourier components Zk, which 
diagonalize the free energy increment 



5^ = Y,B{k)\z k \ 



(169) 



If (163) holds, only the components z q and z_ g are appreciable and 

5T{t) = B(q)ASh 2 (t) 

Taking the time derivative of ( |170D and comparing with ([168]), one finds 

Ml (i) =4a 2 cB(q)5h(t) 



(170) 



(171) 



For a rough (i.e. non-singular) surface, B(q) is obtained if one identifies (170) with (161). This yields B{q) 
-B\\q\ + B 2 q 2 , where 



B 1 



C 2 (8a 
4Ch 



E and B 2 = a/A 



(172) 



We have written \q\ instead of q in order to include the case q= {q x ,qy) with \q\ = q 2 + q 2 

An appropriate treatment of a singular surface requires renormalization group methods flqfl. If one wishes to 
preserve simplicity at the expense of rigor, one can introduce into ( 169 ) a positive coefficient Bq of order 0. 



B{q)=B a -B l \q\+B 2 q 2 



(173) 



This heuristic form does not satisfy invariance under the translation z — > z + c, but correctly describes the 
stability with respect to deformations of any wavelength if T is sufficiently lower than Tr. Moreover, the form (|173|) 



is justified in the p resence of a substrate since translational invariance is suppressed. We shall come back to this point 
in paragraph IX E . For a n infi nite adsorbate thickness, the coefficient Bo vanishes at and above Tr.£^ 
Insertion of (171) into (167) yields 



ddh(t) 

dt 



-ATa 2 cB{q)q 2 8h{t) 



the solution of which is 



with 



Sh(t) = 5h(0) exp(w g £) 



-4a 2 cTB{q)q 2 = -ATa 2 c{B - B x \q\ + B 2 q 2 )q 2 



(174) 



(175) 



(176) 



The condition of stability of a plane surf ace is uj q < for any q. For a non-singular surface, Bq — 0, Lu q i s alw ays 



positive for small q < 2tt/X*, given by (162), and the plane surf ace is unstable in agreement with paragraph IX B 

Moreover, as expected, the absolute value of expression (176) decreases rapidly with q for q < 2ir/\* , so that long 
wavelength components z q (t) develop very slowly. The components which will actually appear are those of wavelength 
slightly larger than A*. Those which are shorter are stable, those which are much larger are too slow. 

Even in the case of a singular surface, there may be a window A* < A < A**, in which ( |176[ ) is positive and the 
surface linearly unstable (i.e. without an activation barrier). In the absence of growth, this can probably occur only 
very near Tr. It will be argued in the next chapter that MBE growth conditions can alter this conclusion. 

Before doing that, it is of interest to give more details on the instability. The above formulae tell us how an initial 
perturbation evolves, but say nothing about the initial perturbation. In a somewhat heuristic approach a la Langevin, 
one can assume that it is produced randomly by some 'noise' and write an equation analogous to (pn. Eliminating 
the current as above and performing a Fourier transformation, one obtains 



dzq 
dt 



UJ q Z q +<p q (t) 



where the 'noise' <p q (t) satisfies the following relations analogous to (12) 



(¥>,(«)> = , (<p q (t)<p* q ,(t'))=<pl8(t-t')5 qg , 



In the absence of noise, ip — 0, (|177|) yields (|175|) . In the presence of noise, integration of (|177j ) yields 

ft 



5^ (t - f 'WO^' 



and use of (178) gives 



(z q (t)z*,(t)) =5 q 



which describes the surface morphology at time t. 
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(177) 



(178) 



(179) 



(180) 



2. High symmetry surfaces and MBE growth 

The previous calculation applies near equilibrium. It is therefore not clear that it can be applied to a growing 
crystal. 

The theory of the previous paragraphs applies to non-singular surfaces near equilibrium. Instabilities are generally 
observed during growth, e.g. MBE, and the surfaces are often believed to be singular. In this paragraph, attention 
will be focussed on high symmetry orientations, (001) or (111). 



Formula (173) is only acceptable for fluctuations near equilibrium. The energy of a bump of thickness equal or larger than 
the atomic distance on a singular surface is given by a quite different formula as will be seen in the next chapter. 



Such surfaces are quite different in the equilibrium state and during growth. The most obvious difference is the 
existence of long, closed steps which bound large terraces or valleys which appear and die at the frequency Fqo 2 of the 
RHEED oscillations. Another difference is that the average adatom density can be much larger than at equilibrium, 
and this can increase the absolute value of the average current, and even change its sign, as will be seen. 

The effect of short-lived terraces is difficult to analyze but canpresumably favour an instability since these terraces 
create a kind of roughness, which actually increases with time 0] and evolves toward true roughness, with diverging 
height fluctuations. 

The remainder of this paragraph is devoted to the effect of the current due to freshly deposited adatoms from the 
beam, which have not yet been incorporated by a step. The surface density pi nc {x 1 y, t) of these adatoms can be much 
larger than the equilibrium adatom density po(T), in particular in the growth of an element (e.g. Si or Ge or a metal) 
when po(T) is given by (j|). In the case of metals, values of the adatom energy Wq are given by Stoltze |4(J. The 
density Pi nc (x,y,t) in MBE is discus sed in A pp. 0. At low temperature, it depends only on the ratio Fq/D of the 
flux rate to the diffusion constant |1 179 18C ] and is much larger than po (T) ■ This inequality is not so clear at the 
temperatures used in MBE growth. In the case of III-V semiconductors, it has been argued pq | that the equilibrium 
adatom density, given by (Q), can be large, of the same order as the adatom density during growth. 

In this paragraph, po will be assumed to be much smaller than pi nc , and the consequences of this situation will be 
investigated. The current jinc(x,y,t) of freshly deposited adatoms on the surface contains a part jf^ c (x,y,t) which 
results from elasticity, and a part JiJ r {x, y, t) which results from other factors, in particular terrace nucleation. The 
l atte r has been discussed in chapter [IV], where its main effect was found to be a modification of the coefficient Bi in 
(173). In the absence of Ehrlich-Schwoebel effect, B2 is increased with respect to its equilibrium value. The Ehrlich- 
Schwoebel effect is generally weak at usual growth temperatures, at least in the case of semiconductors, and will be 
neglected in this chapter and in the following ones. The part which results from elasticity can be roughly written as 
proportional to the average pi nc of the incoming adatom density.p 5 | 



jf nc {x,t) = ~(3Dp inc dV e i(x,t)/dx 
where V e i is the elastic potential energy of an adatom. It has to be proportional to the strain, 

V e i{x,t) = -K x e{x,t) 
and the strain associated to the modulation (1163) has the form 



(181) 



(182) 



e(x, t) = e(t) cos(qx) (183) 

where e(t) is related to the amplitude Sh(t) by ( |157| ). 

Now, there is a surprise [181|. The coefficient K\ can be positive or negative, i ndep endently of the sign of Sa/a\ 
The reason is the following. The coupling of adatoms to the strain, expressed by (181), is partly due to the size of 
the atom, but also has chemical reasons which, a priori, are hard to predict. Only the part of K\ which results from 
size effects h as n ece ssar ily the sign of 8a/ a. 

Formulae fll8l] ), (p^ ) and (^8^) yield 



JiL( x >t) = -pDKip inc qe(t)sin(qx) 



or, replacing e(t) by its expression (157) 



d Sa 



3t n Ax,t) = ~/3DK 1 p mc q 2 ^- — Sh(t)sm(qx) 



2C 2 a 



(184) 



(185) 



When inserted into (164), this expression yields a contribution to (176) proportional to q 3 , and therefore replaces 
Bi by B[ given by 



25 In reality, pi nc (x,y,t) is strongly modulated and vanishes in the neighbourhood of a step. The elastic potential is also 
modulated. In a correct treatment, the current should be proportional to the average value of pinc{x,t)V e i(x,t) on a distance 
large with respect to the distance between st eps. Here, we approximate the average value of this product by the product of the 
average values. A more accurate treatment |18l| has been done in the case of a vicinal surface and leads to the prediction of 
step bunching if the coefficient K\ of formula (182) has the sign of 8a/ a. In the c ase of a high symmetry surface, step bunching 
is likely to appear too, thus leading to higher harmonics to be added to (163). This effect does not show up in the linear 
stability analysis. 



B' 1 /B 1 



Pine K\ a 

po 2a 2 cEC'i 8a 



(186) 



If KiSa/a < 0, the instability can disappear if pine/ Po is large en ough . If KiSa/a > 0, the instability should de velo p 
more rapidly during growth than in the absence of flux. Formula (162) which gives A* should be multiplied by (186), 
so that the wavelength of the surface modulation can be smaller than in the absence of growth. 

Another possible effect of MBE growth is a high concentration of surface vacancies instead of surface atoms. The 
extension of the previous argument to this case would be straightforward. 



D. Experiments 



As will b e seen , i nstab ilities arc observed in hctcrocpitaxial growth, and can be explained by the mechanism of 
paragraphs [XA to IXC. However, a precise comparison is not possible, in particular because thermodynamical 
surface properties (Tr, a ...) are generally not well known 



The instability of coherent, epitaxial films has been observed for instance in the growth of Ge on Si(lll) [182|, 
and of Gei-xSiz on Si(001) [183-L85J. The observed features depend on the Germanium concentration, presumably 
because the misfit does. The misfit of pure Ge on Si is 0.0417. The non-stoichiometric composition allows to obtain 
a tunab le m isfit, but makes com pari son with theory difficult since the concentration can become inhomogeneous for 
kinetic [186] or thermodynamic [187] reasons. 

For small misfits (low Ge concentration), it is possible to grow r athe r thick layers, and at a 'criti cal' t hicknessp| a 
three-dimensional modulation appears, which is 'quasi-periodical' |184| and reminiscent of formula (148). 

For large misfits, the adsorbate splits into clusters and thi s ha ppens at a much lower cri tical thickness. For instance, 
this thickness i s ab out 100 nm for Geo.83Sio.17 on Si(001) [184], and only 3 monolayers |188[ for pure Ge on Si(001) 



and 2 bilayers |182j| for Ge on Si(lll), where the misfit is about 0.04. In the case of InAs on GaAs(OOl), t he m isfit is 
even larger (0.07) and the adsorbate splits into clusters at an even lower thickness, about 1.6 monolayers [189]. 
Berbezier et al. [185] have found that the amplitude of the modulation increases proportionally to {8a/ a) 2 . According 



to previous paragraphs, this amplitude, at a given time, should be given by ( J180| ), where cj q is given by ( |176| ). From 
these formulae, the amplitude would be expected to increase more rapidly than (Sa/a) 2 for long times. 

The case Gei-^Si^ on Si corresponds to 8a/ a > , i.e. the adsorbate is compressed by the substrate. A modulation 
has also been observed (Fig. B21) for Sa/a < [154|. 

The case of large misfits will be addressed in the next chapter, but the splitting into clusters is in agreement with 
the argument of paragraph IX A. The cluster size is just limited by diffusion. 

In the case of small misfits, the roughly sinusoidal modulation which has been observed is presumably a transient 
state, which apparently does not evolve because the dynamics is too slow. S epara tion into clusters would still lower 
the free energy. The qualitative agreement between the theory of paragraph IX B , which predicts an instability, and 
the experiments, which provide an observation of this instability, does not imply that everything is well understood. 
In particular, the theory was made for a non-singular surface, i.e. a rough surface in the sense of chapter |. However, 
a (001) surface is generally believed to be singular at the temperature at which the crystal is grown. The complete 
solution of this puzzle would require a quantitative theory which is not yet available, but the following elements may 
contribute to the explanation. 

i) The surface may contain steps or other defects to be thermodynamically rough. 

ii) Even if the surface of the 'free' material is smooth, the surface of the constrained material may be rough or 
closer to roughness. It is especially interesting that waves appear in tensile films (8a /a < 0) at a lower misfit than 
in compressive films. An increase of the atomic distance clearly favours surface melting (as it favours ordinary, bulk 
melting) and therefore surfa ce roug hness (since a liquid surface is rough). 

iii) As seen in paragraph [X C 2 , a growing surface can be linearly unstable even if a surface which has a similar 
morphology, but is at equilibrium, is stable. 

In any case, it is not surprising that separated clusters have not been observed for a moderate misfit. Indeed, theories 
based on an activated process predict a huge activation energy proportional to (8a/a)~ 4 , as seen from formula |198| ) 
below. On the other hand, if the pl ane surface is linearly unstable, the instability is established at a ra te pr oportional 
to A*~ 3 if the stabilizing terms of (176) are ignored, and A* is very large in weak misfit as seen from (162). 



This critical thickn ess fo r the 2d-3d transition is generally different from the critical thickness for misfit dislocation formation, 
introduced in chapter VIE. 



One can wonder whether the existence of a 'critical' thickness below which the plane surface is apparently stable is a 
thermodynamic or kinetic property. In other words, is a three monolayer thick deposit of Ge on Si thermodynamically 
stable or does it only lack time to deform? This point is not yet cl arifie d. The fact that the 'critical' thickness can 
be considerably increased by surfactants, as will be seen in chapter XIII, suggests that kinetic effects are important. 
The same conclusion is suggested by the argument of paragraph IX A , according to which cluster formation always 
lowers the energy. However, it will be argued in the next paragraph that the stability of thin layers can also have 
thcrmodynamical grounds. 



E. Effect of the adsorbate thickness 



If the stress is produ ced by a substrate, the previous treatment is only reliable if the adsorbate is sufficiently thick, 
as noticed by Grinfeld [ 190, 191 1 . A finite thickness of the adsorbate has in particular the following effects. 

i) The elastic consta nts are u sually not the same in the adsorbate and in the substrate. This introduces compu- 
tational complications |177 192], apart from simple particular cases, e.g. if the adsorbate is very thin. This case is 
treated in paragraph XD below in t he case T < Tr, and the case T > Tr might be treated as well. This effect 
modifies the instability threshold A* [177,192] but does not introduce a critical thicknessQ except in the unphysical 
case of an infinitely hard, undeformable substrate. 

ii) A gr eater thickness implies a longer growth time 



paragraph IXC 



and the instability is enhanced by growth as argued in 



hi) Another effect of a finite adsorbate thickness (z) is the limitation of height fluctuations in the direction Sz < 0, 
where Sz = z — {z). Indeed a local fluctuation always costs capillary energy, but if Sz < —{z), it does not bring any 
elastic energy. It is thus tempting to assume that such a fluctuation is completely forbidden, i.e. the condition 



z> 



(187) 



is satisfied on the whole surface. This is a reasonable approximation if the substrate (but not necessarily the adsorbate) 
is below its roughening temperature. For instance, if the adsorbate becomes thinner than 1 monolayer, the height 
is in most plac es equal to or 1, as the spins of an Ising model. Now, the free energy of an Ising paramagnet has 
the form ( |169| ) (where z would be replaced by the magnetization M in a magnet) with a non-vanishing coefficient 
-Bo- More generally, a finite adsorbate thickness introduces a non- vanishing coefficient Bq, which is now allowed by 
symmetry, even above Tr. Thus, a small adsorbat e thi ckness makes the instability more difficult. Moreover, if Sz is 
equal to or 1, a continuous variation of the form (148) is hardly acceptable and the instability should have different 
features. 

These special features are well illustrated by recent experiments by Ponchet et al. [194,195]. Layers of InAs of 
different thickness were deposited on InP(OOl) by MBE during a short time at about 500°C, then maintained in 
an As flux (but without In flux) at the same temperature during 30 seconds, then buried under InP. The result is 
the following. Deposits thicker than 2 monolayers^ give rise to a fairly regular array of small clusters. Deposits of 
thickness comprised between 1.5 and 2 monolayers give rise to an irregular array of clusters whose volume is about 5 
times as large as when resulting from a thick deposit. 

A possible interpretation of the experiment is the following. For thick deposits, the calculation of paragraph IX B 
will be assumed to be valid. Then a modulation of well-defined wavelength A appears -more precisely, two modulations 
in two orthogonal directions. The amplitude 5h increases with time, and when it becomes of the order of the adsorbate 
thickness, the substrate has to be taken into account and the adsorbed film splits into clusters. The shape of these 
clusters is the subject of the next chapter. The wavelength A, as seen above, is of the order of magnitude of A*. 

For thin deposits, the substrate cannot be ignored even at the beginning of the instability. Since there is a wetting 
layer, its effect is summarized by a condition stronger than (187), namely z > c, wh ere c is the thickness of a monolayer. 



IX B 



For thin deposits, but thicker than 1 monolayer, the theory of paragraph 
does not appear in the linearized treatment. The system is not 'linearly unstable'. 



has to be modified. The instability 
However, it is not stable either 
since, as argued in paragraph IX A 2, the formation of big clusters does lower the energy. Thus, thin adsorbates are 
not stable, but metastable. They can only reach their equilibrium state if they overcome an activation barrier. This 



27 However, a t hickness threshold, below which the instability is not observable in practice, has been been introduced by 



177,192] 



various authors 

In the case of a binary semiconductor, a (001) monolayer designates a set of an atomic layer of metal and an atomic layer 
of metalloid. 



activated process may be expected to give rise to a rather disordered array of clusters, as observed experimentally. 
Indeed, it is a localized effect rather than a cooperative one. The nucleation barrier can be overcome at some place 
and not elsewhere, while the linear instability described in paragraph KB corresponds to the increase of a density 
wave in the whole surface. 

Another point which has to be understood is that clusters which arise from thin deposits are bigger than those 
which result from thick deposits. This may seem paradoxical since, for the same cluster size, the atoms has to diffuse 
on a longer distance in the case of a thinner film. The following arguments can explain the experimental facts. 

1/3 

i) A theoretical value of the typi cal linear size V c of clusters arising from activated processes will be evaluated in 
the next chapter (see formula (197) below). It is the product of (5a/a)~ 2 by the ratio of a surface tension divided by 

an elastic constant, just as wavelength A* given by ( |l62| ). This suggests that both lengths V c and A* have the same 
order of magnitude. However, the volume of the clusters should be, in the activated case, essentially the cube of the 
linear size, i.e. proportional to (6a/a)~ 6 , while in the linearly unstable case, it should be proportional to (z) times 
the square of the linear size, i.e. proportional to (z)(5a/a)~ i as seen above. In other words, in the case of nucleation, 
the volume is that of the critical cluster. If there is a smooth modulation of the surface, the wavelength is the critical 
wavelength. 

ii) The following argument is slightly different, but related. In the Asaro-Tiller-Grinfeld scenario, assumed to be 
valid for thick films, the instability is collective, all mounds appear at the same time and cannot go fishing atoms 
outside their own domain which has a well-defined radius A*. In contrast, if clusters are randomly nucleated, the 
nucleation of new clusters in their vicinity does not occur at once, and the first created clusters have some time to 
sweep remote atoms atoms. 

iii) In the case of thick deposits, the instability can begin during the growth, and the wavelength can be smaller as 



seen in paragraph IX C 2 



The above interpretation of the experiments of Ponchet et al. is only tentative and can be subject to controversy. 
Indeed, the average cluster size has been supposed to be selected at the onset of the instability in a two-dimensional 
layer, which then splits into 'islands' or clusters which have this size. In reality, the evolution after cluster separation 
is not negligible 158, 164, 196- 198 1 as will be seen in chapter XI. A support of our view may be seen in the observation 
that deposits of InGaAs on GaAs or AlGaA s give more regul ar st ructures for (311)B surfaces^] than for the (100) 
orientation, in the case of MOVPE [155, 156 as well as MBE [157|. Indeed, the rougheni ng tra nsition temperature 
is expected to be lower for a (311) face than for (100), so that the argument of paragraph IXB is more likely to be 
applicable. 



F. Solid-solid interfaces and other generalizations 



A case of interest is that of coherent, epitaxial solid-solid interfaces. Such interfaces ar e commo nly encountered 
in semiconductor technology (quantum wells, multilayers). One can for instance consider [199 200 a quantum well 
constituted by a thin slab of a material A in a infinite crystal of a material S. The elastic properties are simpler in 
this case then a multilayer or a single interface between two solids because the lattice constant is that imposed by the 
material S. 

Using the same arguments as for a free surface, one can show that a coherent interface is not planar at equilibrium 
above its roughening transition, which can be defined as that of a free surface. The plan ar shape is linearly unstable 
with respect to small deformations, just as the surface of a coherent adsorbate [ 199| , 200| |. 

If one goes beyond the linear stability analysis, the two cases are different. Large modulations of a solid-solid 
interface are not so favourable as those of a free surface. 

On the other hand, volume dynamics is very slow and solid-solid interfaces can remain planar during a long time. 
We do not need to worry about our TV, its transistors will not be damaged by interdiffusion. 

We conclude this chapter by a hint on possible generalizations of the Asaro-Tiller-Grinfeld instability. In (158), Sh 
is the amplitude of a height modulation. As a matter of fact, it might be the amplitude of the modulation of any 
continuous 'field', for instance the surface impurity density. If this field is l inear ly coupled to the elastic strain, then 



a modulation of this field lowers the elastic e nerg y as shown in paragraph IXB . However, this does not give rise to 
an instability if the coefficient Bq of formula (|173D is strongly positive. 



29 In the (311)B GaAs surface, the steps are As, while they are Ga in the (311)A surface. The (311)A surface is quite different 
and will not be considered here. 



X. THERMODYNAMICS OF AN EPITAXIAL, COHERENT CLUSTER 

A. Energy of a cluster 

In the previous chapter we have seen that a plane adsorbate surface is unstable if adsorption is coherent. The 
final stage of this instability is an array of clusters. If the cluster size has a narrow distribution, these can be used 
as 'quantum dots' in microelectronics after being 'buried' into the material which constitutes the substrate. These 
clusters are the subject of the present chapter. In contrast with the previous chapter, we now study the final or nearly 
final result of the instability, rather than its onset. In the first few paragraphs of this chapter, simple theoretical 
considerations will be made, which then will be compared with exper imen tal observations. 



In contrast with the three-dimensional clusters addressed in chapter VII , those which result from coherent, epita xial 
growth often form a fairly regular array and have a narrow distribution of sizes |155 15£ 18£ .194,195,197 201 J202], so 



that they can be used as quantum dots. This phenomenon is called self- organization. Its mechanism is not well 
understood, but a few conjectures will be presented in the next chapter. We shall first consider a single cluster and 
determine its equilibrium shape. The standard way to do this is to minimize its energy. 



In the linear stability analysis of chapter IX , the energy was supposed to be the sum of an elastic part resulting from 
the elastic strain in the whole sample, and a surface (or 'capillary') term independent of the strain. This assumption 
is in principle not correct for strongly deformed surfaces, and the surface energy does depend on the strain through 
the 'surface stress'. The surface stress can be viewed either as a strain-dependent part of the surface energy, or as an 
clastic stress which is concentrated on the surface and should be added to the part which varies continuously in the 
solid and has only discontinuities at interfaces. In other words, the surface stress is a delta- function singularity of the 



elastic stress at the surface. As will be seen in paragraph XI F, certain experimental properties might be explained 



by taking the surface stress into account, as was done for instance by Shchukin et al. [203| and Muller & Kern [204 1. 
For the sake of simplicity, the surface stress will generally be ignored in the present chapter. This is correct in certain 
limiting cases, e.g. i) for weak strains, which do not affect very much the surface energy, ii) For big clusters, where 
the elastic energy is not much affected by the surface. 



While in chapter IX the temperature was assumed to be higher than the transition temperature Tr which corre- 



sponds to the average surface orientation, in the present chapter the condition T < Tr will be assumed to hold. 

B. Capillary energy of a cluster 

If a smooth film splits into clusters, this modifies the free energy density. As in the previous chapter, one can 
distinguish a 'capillary' energy, localized near the interfaces, and an elastic energy, which is not. The elastic energy 
is simpler because its variation in space obeys linear equations which only depend on the known elastic constants of 
the materials. The capillary energy is difficult to evaluate from first principles, and even from empirical potentials 
since it depends, for instance, on surface reconstruction, which is an essential feature of all semiconductors. We shall 
present here a qualitative view. 

The capillary energy of a cluster contains in principle two parts. One corresponds to the free surface, and the other 
to the adsorbate-substrate interface. The main effect of the latter is to produce, in many cases, as seen above, a 
wetting layer. The energy of the cluster-wetting layer interface will be ignored. 

The property T < Tr implies that the capillary energy increment 8F cav resulting from a modulation is no longer 



proportional to 5h 2 as it was in (159). Instead, it is proportional to Sh for small values of Sh. Indeed, the perturbed 
surface may be viewed as formed by flat parts separated by steps. The step free energy per unit length 7 is positive 
below Tr and the number of steps is proportional to Sh. 

Similarly, a cluster of height h whose basis has a linear size R has a capillary free energy equal to 

Fcap = Const x jhR/c 

where the constant is positive since a negative value would imply that the plane surface is unstable even in the absence 
of misfit. The constant depends on the details of the shape of the cluster. We shall assume that it is a truncated 
pyramid (Fig. |3g) with a square basis of side R, and that the sides make an angle 9 with the substrate surface. Then 
the detailed expression for small is 

Tca P = 4 7 (/i 2 /c) \^- cot 9 (188) 

1 — X 

where 



x = R'/R = 1 - — cot 9 (189) 

R 

is the ratio of t he s ides R and R' of the two square faces. 

Expression, ( |188|) is correct for | tan#| -C 1 a nd q ualitatively valid for | tan#| < 1. For larger values, the interaction 
between steps are not taken into account, but ( |188| ) still gives the right order of magnitude. 

In the following, it will be assumed that the cluster is a truncated pyramid. The ambition of a theoretical treatment 
is thus restricted to find the three parameters R, h, x which minimize the free energy. In the absence of elastic effects, it 
is possible to do much better and to find the equilibrium shape without any restriction by the Wulff construction P,H . 
The existence of an elastic energy makes things more difficult. The experimentally observed shape is not always a 
truncated pyramid. Fig. B7^ shows a more complicated shape which results from the fact that the surface of a 
semiconducto r ha s a rectangular rather than square symmetry. The cluster shape will be addressed in greater detail 



in paragraph XG. 



C. Scaling laws for the elastic free energy of a cluster 



The explicit form of the elastic Green's functions for a semi-infinite isotropic medium [171| or cubic crystal [17C] 
bounded by a planar, stress-free (001) surface is known. However, these Green's functions are not directly applicable 
in the pre sence of adsorbed clusters except if these clusters are very thin. This special case will be addressed in 



paragraph XD for an isotropic medium. 

In the general case, the only simple statements which can be made about the elastic free energy are scaling laws, 
which hold for any cluster shape. 

As in the previous chapter, the energy will be counted from a state where the substrate has its 'natural' lattice 
constant and where the adsorbate has the lattice constant of the free substrate. The energy of that state (set equal to 
by our choice of origin) is higher than that of the equilibrium state, since the elastic strain precisely tries to lower 
the elastic energy. Therefore, with our convention, the elastic energy at equilibrium is negative. The strain will also 
be counted, as in the previous chapter, from the above defined state where the substrate will be said to be 'totally 
unconstrained' and the adsorbate 'totally constrained'. 

A list of scaling laws, which hold within linear elasticity theory and ignoring surface stress, will first be given. 

A. The elastic free energy of a coherent, epitaxial cluster of a given shape and size is proportional to the square 
(8a/ a) 2 of the misfit. 

B. The elastic free energy of a coherent, epitaxial cluster of a given shape with a given misfit is proportional to the 
cluster volume. 

In other words, if all linear dimensions are multiplied by A the elastic free energy is multiplied by A 3 . The shape 
may be a pyramid, a truncated pyramid, a half-sphere, etc. 

C. The strain created by a coherent, epitaxial cluster of a given shape at a long, given distance f is proportional to 
the cluster volume V. 

D. The strain created by a coherent, epitaxial cluster of given shape and size at a long distance f is proportional 
to 1/r 3 for any given direction f/r. 

The same law holds for an isolated adatom. 

E. The elastic interaction free energy between two coherent, epitaxial clusters of given shapes at distance f is 
repulsive and proportional to 1/r 3 for large r. 

V el {r) = B/r 3 (190) 

The same law holds for two isolated adatoms. 

These scaling laws are derived in App. FEj. Property (A) is a straightforward consequence of the linear and continuous 
nature of elasticity theory, which as a matter of fact is an approximation. The other scaling laws, especially (D) and 
(E), are well-known for an isotropic elastic medium. The quantitat ive f ormulae which give the response of an isotropic 



solid to superficial forces hav e been obtained by J.V. Boussinesq [205] at the end of the nineteenth century and can 
be found in textbooks p,171|. 



It is of interest to compare the elastic interaction between adatoms and the phonon-mediated interaction between 
electrons in the BCS theory of superconductivity. Although electrons ore quantum particles, there is a strong analogy 
between their interaction and the elastic interaction between two point impurities in a bulk solid. However, the 



interaction between electrons is generally assumed to be short ranged, in strong contrast with (190). As a matter of 
fact, the elastic strain created by a point impurity in a bulk solid is proportional to 1/r 3 as in the surface problem ... 
but the term in 1/r 3 of the interaction energy between two point impurities turns out to vanish n. 



D. Case of a thin cluster: Tersoff approximation 



We can say a little more if the adsorbed cluster is sufficiently thin. For the pyramid of Fig. |3g, this means that 
h <C R or/ and tan fl <C 1. In this case, an approximate calculation of the elastic energy, often used by Tersoff and his 
coworkers [ 209 ,207] and hereafter called 'Tersoff approximation' is possible. It relies on two simplifications. 

i) In (|149|), the strain is assumed independent on z, so that 



bT\ = -Const x — / dxdy[e xx (x,y) + e yy (x,y)}z(x,y) 



(191) 



where the integral is on the substrate-adsorbate interface and z is the height of the adsorbate surface with respect 
to this interface. Thus, the integral is over the substrate-adsorbate interface, which is assumed to be a plane. If the 
adsorbate is an isotropic solid of Young modulus E a and Poisson ratio v a , th e con stant is equal to (5a/a)E a /(l — v a ). 

ii) Since the adsorbate is thin, its contribution to the term T2 defined by ( |l52|) is negligible. 

With these approximations, the problem is reduced to finding the response of a sem i-infi nite crystal limited by a 
plane (the substrate) to known forces acting on the surface, which are fully defined by (191). 

In the case of a truncated pyramid with h <C R, the strain is proportional to h for fixed R and 9, so that T<z is 
proportional to h 2 . The linear part T\ is also proportional to h 2 since z is prop ortio nal to h in (191). Thus, the total 
elastic energy is proportional to h 2 . On the other hand, according to paragraph XC, the energy is proportional to R 3 



for fixed h/R and 9. We conc lude that the elastic energy of a low pyramid is proportional to h 2 R, in agreement with 
the quantitative formula ( 192 ) below, valid for a coherent epitaxial cluster of a pyramidal form in the limit tan# <C 1 
in the isotropic limit (en — C12 = 2C44). The (tedious) derivation of this formula essentially relies on the fact that, 
in the limit tan 9 <C 1, one can use the elastic Green's function (i.e. the elastic response to a localized stress) of 
a solid bounded by a plane surface, which is known. The Green's function method yields the energy as a multiple 
integral which can be calculated for an isotropic solid. If both R and h are much larger than the atomic distance, one 
obtains pOgLpg 
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where E s and E a , v s and v a are the Young modulus and the Poisson ratio of the substrate and the adsorbate 
respectively, V is the volume of the cluster, x is defined by ( |189| ) and G{x) is a function which decreases smoothly 
from a value close to 0.5 (for x — 0, i.e. a full pyramid) to reach the value with a finite slope for x = 1, i.e. a slab 
(Fig. [38|). For a given shape, x is fixed and T e i is proportional to tan 9 which is itself proportional to h 2 R as stated 
above. 

The analytic form of G(x) is impressive, namely 
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Numerical calculations [21C] are in qualitative agreement with (|l92j) in the case of interest, i.e. h -C R and tan 9 <C 1. 

The Tersoff approximation makes calculations much easier, but reveals consistency problems. Consider a coherent, 
epitaxial cluster which is infinitely long in the y direction, and whose section by any plane parallel to xOz is a triangle 
of thickness 2R and height h, so that the height h(x, y) = h(x) is h(x) = h — h\x\/R if |x| < R, otherwise h(x) = 0. 
The strain on the surface of the substrate, at a distance a of the cluster edge, has the form 
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where the respon se fu nction l/(a; + R + a) 2 results from the scaling law (D) of paragraph XC, after integration on 
y. For large R, (195) diverges as ln(i?/a). A similar divergence would arise in the case of a pyramidal cluster as 



well. This divergence is unphysical, since a strain cannot diverge. It may be an artifact of the Tersoff approximation. 
However, a failure of linear elasticity theory is not excluded, although more elaborate versions of linear elasticity have 
apparently been applied successfully to this problem by Freund et al. [163, 211] and by Shchukin et al. [203]. The 
Tersoff approximation is erroneous in at least two respects: i) The stress in the z direction is not correctly taken 
into account, ii) When replacing the cluster by a surface density of external stress, one has to take into account the 
fact that the resistance of the cluster to deformation is proportional to its thickness, so that surface elastic constants, 
proportional to the cluster thickness, should be introduced. Since the logarithmic divergence is weak, and the results 
are compatible with the scaling laws, the Tersoff approximation will be assumed to give the right order of magnitude. 



E. Equilibrium shape of a coherent cluster: 2d-3d transition 



When the energy (160) is known, it can be minimized at a hxed volume V with respect to 9 and x, and this yields 
the equilibrium shape. 

For small volumes V, the elastic energy (proportional to V) is small and the capillary energy is mainly due to the 
sides of the pyramid. The area of these sides should therefore be as small as possible. T his a rea is proportional to 
y/Vh multiplied by a coefficient of order unity which depends on the shape (see formula ( |188| )). It follows that the 
cluster should be as thin as possible, i.e. it should reduce to a single monolayer. 

For larger values, the elastic energy dominates the capillary energy. It is given by ( |192| ) with an accep table 
approximation for tan 9 < 1. For larger values, other approximations can be used, and it can be seen [ 208 | that 
the elastic energy is minimized if the cluster is three-dimensional, with h s=s R and tan 9 wl. As a matter of fact, 
experiments are consistent with a smaller value, tan9 w 1/4 to 1/2. 

We conclude that there is a '2d-3d' (two-dimensional to three-dimensional) transition when the volume increases. 
An analysis based on the formulae of the previous chapter shows that this transition is discontinuous. Clusters 
initially have the shape of platelets, then abruptly change their shape and become pyramids with tan 9 rj 1/4. The 
transition takes place for a volume V c which will be now roughly evaluated. A detailed calculation will be found 
elsewhere |20(|f20§]. 

The energy of a cluster is the sum of ( |l88|) and (|l92| ) or, dropping constants which depend of the sample shape, 
but are generally of order unity, 
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This expression is positive for small volumes V , reaches a maximum when V increases, and is negative for a linear 
size larger than 
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The maximum of the energy is obtained for a value of order V ac « 2V^/3 and the corresponding energy is the 
activation energy 
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Thus, the theory predicts a first order transition ('2d-3d transition') from flat, one monolayer thick clusters to 
three-dimensional clusters when the volume increases. The transition occurs for a volume close to V c . 

As a matter of fact, the 2d-3d transition had been observed experimentally before any theoretical prediction, as 



will be seen in paragraph X G 



In spite of its success in 'predicting' the 2d-3d transition, formula ( |196| ) is only qualitatively correct. The elastic 
part (192), in particular, must be modified in the case of clusters of thickness one monolay er. The correct formula 
contains a logarithmic correction, and the energy of such a platelet turns out to be [200.2071 
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where c is the monolayer thickness, a is such that a 2 c is the volume of the unit cell, and 
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In 7] = 2 In 2 - 2 + V2 - ln( V2 + 1) = -0.081 = In 0.922 



(200) 



The loga rithm in the elastic part (second term) of expression (19S) violates the scaling law T e i ~ V of para- 



graph XC 



B0I 



Relation ( |l99| ) has been obtained by applying elasticity theory to an object of atomic thickness. This is obviously 
not qu anti tatively correct. An idea of the correction to be made is obtained by taking into account the surface 



stress [204]. 



It is of interest to use (19£) to obtain an order of magnitude of the elastic energy per atom. Typical values of 
the Young modulus are E s c^ — 12 eV for GaAs and E a (? a — 9.3 eV for InAs. The energy per atom is proportional 
to In (R/c)/(R/c), with a multiplying factor of the order of 0.01 eV for InAs/GaAs, proportional to the square of 
the misfit. Elastic forces are really weak at the atomic level because the misfit cannot be very large. But they are 
important for a big enough cluster because of their long range character. 



F. More ambitious treatments 



In this review we addressed an isotropic material and approximated the energy by formula (160), ignoring the effect 
of the strain on t he su rface energy, i.e., the surface stress |g|,Q. 

Shchukin et al. 1 203] have given expressions which are far more ambitious in three respects. Firstly, they are valid in 
the physical ca se of a cubic crystal. Secondly they take into account the surface stress (which has also been considered 
by Moll et al. |212|). Thirdly, they are not restricted to thin adsorbates. However, the formulae of Shchukin et al. 
contain the elastic functions of a single material and therefore hold only if the adsorbate has the same elastic constants 
as the substrate. 

The complications with respect to (192) in the work of Shchukin et al. are moderate. The surface st ress j ust 
introduces two additional terms, which are negligible if the cluster size is large enough. Shchukin et al. 203 do 



not give explicit expressions of the various terms, but write useful scaling laws in terms of functions which can be 
calculated numerically by means of numerical integration of the equations of elasticity (the so-called finite clement 
method) . 



Using this method, Freund et al. [211 and Johnson & Freund [163] demonstrated a spectacular deviation from the 
predictions of the Tersoff approximation for thick clusters. They find that the strain perpendicular to the substrate 
surface changes its sign beyond a height of about 0.22 x2R, where 2R is the basis diameter. Usually, cluster formation 
or other instabilities of the Asaro-Tillcr-Grinfeld type are observed when the substrate is bigger than the adsorbate 
(i.e. 5a > 0), otherwise other types of instabilities generally occur, e.g. dislocations or twins. Then, near the interface, 
the adsorbate is obviously compressed by the substrate in the x and y directions. However, at a height larger than 0.22 
a, it is stretched! An intuitive explanation for this surprising behaviour may be as follows. The compression in the x 
and y directions is accompanied by an extension in the z direction, because the Poisson ratio is positive in all materials. 
Thus, the lower layers of a cluster push the upper layers upward. This can produce a stretching in the perpendicular 
direction, as happens if you press the center of a rubber leaf. Freund et al. were able to find an analytic version 
of their numerical results in a much older article by Ling [213] who himself used previous analytic theories [214,215] 
made during the second world war. Elasticity is indeed a very old science, and an accurate bibliography requires a 
very good library! 

A quantitative treatment of the capillary energy is eve n mo re difficult than for the elastic energy. On e can represent 
the interaction between atoms by an empirical potential 216] or make a first principle calculation 217 1. In both cases 
many reconstructed structures are a priori possible, and are a source of serious difficulties, since it is in principle 
necessary to compare the energies of all of them, to take into account the possibility of transitions from a structure 
to another, etc. Moreover, the reconstructed structures are experimentally known only for simple surfaces, and not 
for all the orientations which appear, for instance on the cluster sides. 



In short, the logarithm arises from the following fact. Any two atoms of the adsorbate at points f and r produce a strain 
in the substra te, which in turn produces an effective, elastic interaction proportional to l/|r — f\ A between the two atoms 
according to ( 19C ). The calculation of the energy of a platel et re quires 4 integrations, namely on the two components of r 
and r . Three of these 4 integration s yie ld the logarithm of ( |l99|) , while the last integration yields the factor R before this 
logarithm. It is noteworthy that, in (192), the logarithm has been cancelled by further integration on the vertical coordinate. 



G. Experiments 

An essential prediction of the simple theory presented in this chapter is the discontinuous 2d-3d transition. This 
is consistent with the experimental fact that, for a large enough misfit, three-dimensional clusters appear rather 
suddenly after the deposition of an appreciable mas s of mat erial, which, as expected, decreases with increasing misfit. 



For InAs/GaAs(001), it is about half a monolayer 1 196 197] above the wetting layer 



Another prediction of the simple theory is that big clusters are somewhat sharper than smaller ones. This is also 
in agreeme nt w ith observations. As a matter of fact, the first clusters which appear in the growth of Ge on Si(OOl) 
are 'huts' [|218| (Fig. p7]). These elongated islands are favoured by the symmetry of the (001) faces of the diamond 



lattice, but seem to be metastable p!9|. The stable shapes, according to Medeiros-Ribeiro et al. [219] are pyramids 



(square-based islands bounded, as huts, by {105} facets) and 'domes' ("structures with a large number of facets that 



look rounded at lower reso lution" ]21S| ]). Domes seem to be the stable shape of bigger clusters 219], and the fact that 
they are also steeper |219[ is in qualitative agreement with theory^] 

However, this agreement is only qualitative and the experimental situation seems more complicated than the simple 
theoretical description given above. First of all, the size distribution is 'bimodaP, i.e. it has two maxima which 
respectively correspond to pyramids and domes. Second, the sides of the pyramids have a low slope which would be 
expected to be unstable. 

Intermediate stages between platelets and and three-dimensional clusters are difficult to observe, in agreement with 
the fact that they should be excluded in a discontinuous transition. An indirect investigation is however possible 
using luminescence study. This method is essentially a measurement of the energy of the electronic wave functions 
localized in the adsorbate. and is more sensitive to the thickness h of the adsorbate than to its lateral size R. As it is 
applied to buried layers, the 2d-3d transition process is stopped, and a comparison of different steps of the adsorbate 
evolution is allowed. 



Rudra et al. [221] have deposited a thin InAs layer on InP (2 ML for example) and covered it by InP. When the 
InAs layer is immediately covered, they observe a single luminescence line, corresponding to a thin, two-dimensional 
quantum well. When the coverage is performed after a certain time, they observe several luminescence lines, which 
can be attributed to different thicknesses of the layer. Calculation of the energy of transition showed that each line 



corresponds to an entire number of ML, between 4 and 8 MLs [222]. When a longer growth interruption is p erformed 



before coverage by InP, the luminescence exhibits a wide line, characteristic of an array of InAs islands |221[. Similar 



results were obtained by Lambert et al. [223] 



An interesting study of the smallest possible three-dimensional clusters has been made for InAs/GaAs by Colocci 



et al. [224| using luminescence. When increasing the coverage, they obtained an increasing number (1, 2, 3...) of 
luminescence lines, which they attributed to an increasing number of layers of the clusters. 

This seems to contradict the theoretical statement that the cluster shape should jump abruptly from a two- 
dimensional state (1 layer + the wetting layer) to a three-dimensional shape with many layers. The reason of 
the unexpected observation of two- monolayer- thick clusters may be attributed to a lack of thermal equilibrium. The 
quasi-critical clusters o f Co locci et al. have a size of several thousands of atoms, in qualitative agreement with 



theoretical predictions |208] 



XI. FORMATION AND EVOLUTION OF A POPULATION OF CLUSTERS 

A. The three stages 

In the present chapter, cluster formation will be addressed, as well as the evolution of the resulting population of 



clusters. In contrast with chapter [X, the misfit is assumed to be rather large, so that three-dimensional clusters form 
directly on the wetting layer, while a smaller misfit allows several adsorbate layers to be deposited before the onset 
of the instability. The case of large misfit corresponds, for instance, to InAs/GaAs(001), where 8a/ a ~ 0.07. 

The evolution proceeds as follows, i) Formation of two-dimensional clusters on the wetting layer. Their number is 
determined by kinematic factors, mainly the diffusion constant D. At lower temperature, D is small, adatoms cannot 
diffuse very far, and there are many small islands. For a given coverage, the island size increases with temperature, 
ii) As more matter is deposited, clusters grow and become three-dimensional (the 2d-3d transition studied in the 
previous chapter), iii) As growth proceeds, the cluster size increases and the cluster distribution evolves. 



L A transition from huts to domes has also been observed with a weaker misfit of 0.008, in SiGe/Si(100) [220] 



For lower misfits, e.g. in the case of Ge on Si, the beginning of the growth will be different, but stage (iii) is fairly 
analogous. 



During stage (i), addressed in paragraph XI B, the effect of elasticity is presumably not very strong, because the 



elastic interactions between adatoms and very small clusters are generally weak with respect to chemical, short range 
interactions responsible for capillary effects. In contrast, stage (ii) is an elastic effect. In phase (iii), the importance 
of elasticity is not so clear. 



As will be seen in paragraph XI E, the size d istributio n of three-dimensional clusters is generally narrow. This fact, 
which is n ot ob served for incoherent clusters [ 159| , 225|] is often attributed to elasticity. Actually, as will be seen in 
paragraph XI B , a population of two-dimensional, coherent clusters at equilibrium, would have a narrow distribution, 
but the case of three-dimensional clusters is controversial. 



The next three paragraphs XI B , XI C , XI D address simple, well understood and commonly accepted theoretical 
concepts which, unfortunately, do not give the complete solution of the essential problem, which is the ev olutio n of 
a set of three-dimensional clusters. The experimental data about this problem are treated in paragraph XI E and 
compared with theoretical views in paragraph XI F . 



B. When a monolayer starts growing 



As su mmarized in App . M, the growth of an uncompleted monolayer (or 'submonolayer') is rather well under- 
stood |4 , 17£ . 18C , 226 228 1 when elastic effects are absent, and indeed, as argued in paragraph XI A , elasticity is 
presumably not essent ial at low co verage. 

The usual theory jl 17E18C.226] deals with the growth of an element. Atoms of a single species are deposited at 
the rate of Fp monolayers per unit time (Fig. B3) on an initially planar, high symmetry surface (e.g. (001)). As seen 
in chapter III, application to binary semiconductors is possible although some caution is necessary. 

The best physical insight is provided by rate equations |179 18C.226] which describe the evolution of the number 
of atoms and islands. In the simplest approximation (acceptable at low temperature) only the adatom density p(t) 
and the cluster density p > (t) are considered, and very simplified forms of the rate equations are, omitting numerical 
factors: 



and 



f = -Fo - 2ZV - D W> 



t = °S 



(201) 



(202) 



The various terms correspond to deposition from the beam, formation of a pair and sticking of a n adatom on a 



cluster. Since local effects are ignored, numerical check by Monte-Carlo simulations is necessary [228 



The rate equations express the fact that freshly deposited atoms have two possibilities to form chemical bonds. 
They can either go to already formed terraces or nucleate new terraces. The former process requires a long diffusion 
on the surface, which is more likely to be possible (for a given coverage) if the diffusion constant D is high and if the 
growth rate Fq is slow.Q Therefore, the number of terraces per unit area after deposition of, say, 10% of a monolayer, 
is an increasing function of Fo and a decreasing function of D. More precise formulae are given in App. pi In practice, 
Fq is rarely much larger or much smaller than 0.1 monolayer per second (within a f actor 10) but D is very sensitive 
to temperature and can also be influenced by additives or 'surfactants' (see chapter XIII ). 

If elastic effects and other causes of instability are not too important, terraces begin to coalesce when the coverage 
becomes close to 1/2, and a complete layer forms when the coverage is equal to 1. This happens for the first ('wetting') 
layer in the case of InAs/InP and InAs/GaAs. The formation of the next layers, in coherent epitaxy, is only possible 
if the misfit is very weak. 



32 Of course, nucleation dominates at the very beginning of the growth process, 
coverage is an appreciable fraction of a monolayer 



while it becomes very unlikely when the 



C. Elasticity and the 2d-3d transition 



While elasticity can be neglected at the very beginning of the grow th, it sho uld be taken into account when clusters 
are big enough to become three-dimensional, and the rate equation s 201 and 202 must be generalized to incorporate 
the 2d-3d transition. This has been done by Dobbs et al. [ 165, 229 1. They write a set of three equations for 3 time- 
dependent (but space-independent) quantities which are the numbers of adatoms, of two-dimensional clusters and of 
three-dimensional clusters. Since these quantities are space- independent, the rate equations are global and cannot 
describe the effect of elastic forces on the current of matter. Elasticity is implicitly contained in the theory since it is 
the cause of the 2d-3d transition. Moreover, elasticity determines the detachment rate of atoms from stepsQ 

This detachment rate is especially important for two-dimensional clusters, because "atoms detaching from the edge 
of a three-dimensional island are more likely to find the energetically favourable sites in the higher levels than to 
completely detach from the island" [229|. In the case of two-dimensional islands, the detachment rate depends on the 
island size and this effect favours a particular island size as will be seen below. 

Even though we are mainly concerned with growth, it is of interest to investigate the equilibrium distribution of a 
cluster population. A good reason for this interest is that it is often convenient to anneal a deposit after it has been 
grown. 

The equilibrium cluster size Rq at a given coverage can be determined by minimizing the total free energy, or 
alternatively the free energy per unit adsorbate volume, which is obtained by dividing the free energy per cluster 
(199) by the cluster volume V — cR 2 . Thus, one has to minimize 
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with respect to R. The result is a finite value, namely 
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We conclude that the equilibrium state of a population of platelets corresponds to nearly identical platelets with a 
finite size close to Rq, with rather weak thermal fluctuations. 

The above argument neglects interactions between platelets, but is probably qualitatively correct even if the coverage 
is not very low. 

Thus, if a population of epitaxial, coherent platelets is annealed, their distribution should tend toward the equilib- 
rium distribution and therefore, toward a narrow size distrib ution ... provided they do not become three-dimensional, 
as they are expected to do. However, Priester and Lannoo [ 108 1 have pointed out that the existence of an optimal 
size Rq for two-dimensional clusters can contribute to the mechanism responsible for the narrow size distribution of 
three-dimensional epitaxial, coherent clusters. The kinetics associated to this mechanism have been studied by Dobbs 
ct al. [165, 229| . The time evolution of the size and number of three-dimensional clusters is well reproduced by Dobbs 
et al. but they cannot say anything about the size distribution since it is not included in the theory. 



D. Annealing of a population of incoherent clusters in the absence of growth: the Lifshitz-Slyozov mechanism 



Before investigating more in detail the evolution of a population of clusters on a growing surface, it is of interest 
to examine this problem in the simpler case of a surface which does not grow (and does not evaporate either). In the 
absence of elastic effects, when on ly the surface energy has to be taken into account, the problem has been solved by 
Chakraverty & Avignon [128-13C]. The evolution tends to reduce the surface energy, and therefore the total surface 
of the clusters. Since the adsorbate volume is constant, the ratio Surface/Volume decreases, and therefore the average 
cluster size increases. The microscopic mechanism is the following. Adatoms escape from smaller clusters and diffuse 
on the surface to bigger clusters where the free energy per atom (i.e. the chemical potential) is lower on smaller 



33 The concept of detachment rate, in the frame of global rate equations, is ambiguous. For instance, if an atom detaches 
fro m a cluster and attaches immediately after that to the same cluster, should this event be called 'detachment'? Dobbs et 



al. [22£] do not, since, in their formula (23), they introduce a barrier to detachment from 2-D clusters, defined as the elastic 
energy necessary to detach an atom and to bring it to an infinite distance. 



clusters (Fig_ 
solid alloys 



This mechanism, called 'Ostwald ripening' is rather universal and was first discovered in demixing 



230| , P31| . 

The theory predicts t hat the a verage cluster size R(t) is proportional to R(t ) ~ t 1 / 4 for three-dimensional, incoher- 
ently adsorbed clusters 128- 130] and R(t) ~ t 1 / 3 in three-dimensional alloys [ [230[|23l| . The effect is also present for 
two-dimensional adsorbed clusters ]232| in the absence of elasticity, and then R(t) ~ t / 3 . 

The expected size distribution is broad. 



Experiments on incoherent clusters [ 158, 225 1 are in qualitative agreement with theory, although the exponents do 
not always fit very well. The behaviour R(t) ~ t n is presumably only valid at very large times | p8fl . 

How are the above conclusions modified by elasticity? In particular, does the average size increase with time? A 
negative answer might reasonably be expected since the elastic interaction between adatoms is repulsive and long- 
ranged. These features explain the fact that two-dimensional, coherent, epitaxial clusters have an optimal size. In 
the case of three-dimensional clusters, the answer is not so clear, but qualitative considerations (App. E) suggest that 
the total elastic energy of a population of three-dimensional clusters is a decreasing function of the average size and 
therefore favours an increase of this size. In other words, atoms are still expected to detach preferably from small 
clusters and to diffuse toward small ones. As seen in the next paragraph, this seems to be in contradiction with 
experiment. This discrepancy suggests that new concepts should be introduced, as will be seen in paragraph XI F. 



E. Experiments 



In this paragraph are reported some experimental results on the evolution and size distribution of coherent clusters 
where elasticity has a role to play. 

Many studies have been devo ted to Ge/Si(001), and only a small part of them will be reported here. 

According to Goldfarb et al. [|159| , qualitative agreement with the Lifshitz-Slyozov theo ry ha s been observed in this 
system at temperatures T >700 K. However, the size distribution, as said in paragraph XG, has two maxima. The 



maximum which corresponds to small pyramids disappears at sufficiently high coverage. Then, the micrographs and 
histograms published by Ross et al. [233 show a remarkably narrow size distribution. Moreover, there is no hint of 



any increase of the average size of the coherent 'domes' which correspond to the higher maximum. This seems hardly 
compatible with the Lifshitz-Slyozov theory. 



The tendency to ripening predicted in paragraph XI D (decreasing number of clusters, increasing size) has been 
observed in III-V compounds under certain growth conditions. When the cluster size re aches a certain value ('critical 
size'), dislocations appear in the cluster, and the situation described in paragraph XI D becomes effective. However, a 
precise comparison with theory is often difficult because of the coexistence of coherent and incoherent clusters , wh ere 
the latter are la rger than the critical size and the former are smaller. This has been reported for example [162] in 
InAs/InP(001) [162| and InAs/GaAs(001) 160 |. A general feature when misfit dislocations have formed is that the 
size distribution is wide, as expected from theory. However, it should be pointed out that this evol ution is possible 
only under certain favourable growth conditions, for instance a long annealing after deposition 162]. Actually, very 
often, the whole population of c lusters remains coherent, which probably implies that their size does not exceed the 
critical value. Gerard et al. [197| have observed in the InAs/GaAs system that after the initial nucleation of numerous 
small islands, the average cluster size tends to increase when an annealing is performed after the deposit. However, 
the size does no longer increase after a certain annealing time. 

These results suggest that there is a competition between two effects and that there is a barrier which opposes 
usual Ostwald ripening. The different experimental results reported above show a strong dependence on the growth 
conditions and on the lattice mismatch. This implies that the evolution towards cluster ripening, although it is 
energetically favourable, can be kinetically delayed. 

What happens when the clusters are still coherent (and strained) is rather complex. An interesting set of results 
concerns the evolution of the average size as a function of the quantity of strained material. W hen this quantity is 
large, the cluster density is large too, and the average size increases with the depo sited quantity [18E1 as it is usually 
expected. This is contrast to what has been observed in the InAs/GaAs |164 197] and InAs/InP 194] systems when 
the deposited quantity is very small, i.e. just enough to induce the 2d-3d transition. When the deposited amount 
increases, the cluster density increases too, but the average size decreases. 

A remarkable experimental feature is the relatively narrow size distribution of such population of coherent clusters. 
The size distribution is particularly narrow when the cluster density is high; This fact suggests a possible role of 



clastic interactions between islands 1 164 197] . According to Kobayashi et al. [164], a sufficiently dense population of 
adsorbed clusters evolves, through inter-cluster exchange of matter, towards a more uniform size distribution. This 
is exactly opposite to what is predicted for incoherent adsorption, as seen in paragraph XII-D. 



F. Open questions 

Various explanations have been proposed for the narrow cluster size distribution and for the absence of Ostwald 
ripening, at least in certain cases. Our feeling is that the situation is unclear, and we shall mainly formulate questions, 
which are subject to controversy at the moment, but will presumably be answered in the next future, since research 
in this field is particularly active. 

Question 1. What is the potential energy of an adatom outside clusters? Is there an energy barrier near the cluster 
surface? 



As seen in paragraph X_C, the strain produced by a cluster increases with decreasing distance. The elastic potential 



felt by an adatom should also increase in absolute value. In principle, although the elastic interaction betwee n tw o 



adatoms or between two large, coherent clusters is repulsive, the cluster-adatom interaction can be attractive [ 234 1 . 
Such an attraction would enhance Ostwald's ripening and therefor e wo uld not help to explain the experimental facts. 
Therefore, we will accept the general belief that an energy barrier |235| | exists near a cluster. It opposes attachment of 
adatoms to clusters but, according to the detailed balance principle, it should also oppose adatom detachment from 
islandsrj and therefore slow down the evolution during annealing. This suggests a second question. 

Question 2. Are there big d iffere nces between the evolution when the beam is on and when it is off? 



As reported in paragraph [XE, this difference is clear when the 2d-3d transition takes place during anneal- 
ing p9^j22l| , |236| |. 

In the case of three-dimensional clusters, one could imagine that there is no evolution during annealing without 
beam, and a rapid change of the size distribution during growth. This would mean that atom detachment from islands 
is negligible and that the evolution is fully due to freshly deposited adatoms. Actually, the evolution during annealing 



can be slower than under irradiation 197 ], but is yet present 



Question 3. How high can be an energy barrier to adatom attachment, and how does it depend on the cluster size? 

The barrier height is related to the local strain. In the limit of very large clusters, the strain should reach a finite 

limit. The barrier height is al so ex pected to have a limit W\,ar which is reached for some value Ru m of the cluster 



radius. According to Barabasi 235 ], this limit can be comparable to the chemical bonding energy, and thus the cluster 
size cannot grow beyond Ru m ■ This saturation is a non- linear effect which suggests a breakdown of linear elasticity 
theory ... 



Question 4. Is line ar elasticity theory applicable in the presence of coherent, epitaxial clusters? 



? 



In paragraph XD , the strain in the vicinity of a cluster has been evaluated by the Tersoff approximation and 
found to diverge with the cluster size. This may be an artifact of the approximation, but, in the present state of our 
knowledge, a divergence of the strain calculated by linear, continuous elasticity theory is not excluded. This would 
mean that this theory breaks down for big enough clusters, and a plausible criterion is that it breaks do wn when 



the elastic energy per bond is comparable to the chemical energy, in agreement with Barabasi's statement [ 235 1 . For 
reasonable cluster sizes, this breakdown takes place only for large misfits, since the logarithmic divergence of (195) is 
very weak. 

Question 5. What is the effect of the 2d-3d transition? 



The 2d-3d transition can favour a narrow distribution of clusters 198 . The following mechanisms are possible. 
First, platelets have a preferred size and therefore should have a rather uniform distribution before they transform. 
The distribution may remain rather uniform after transformation. Second, the barrier for incoming adatoms is 
somewhat higher for three-dimensional clusters than for platelets. Thus, platelets grow faster than three-dimensional 
clusters. Third, there is a purely geometric effect, when the biggest clusters are already three-dimensional while some 
smaller clusters are still flat, the incoming atoms can be expected to go preferably to the latter, just because they 
occupy more space on the surface. Thus, in the neighbourhood of the 2d-3d transition, small platelets grow faster 
than bigger, three-dimensional clusters. However, as seen in the previous paragraph, there is experimental evidence 
that equalization still operates a long time after the 2d-3d transition. 



So far as we understand, we disagree on this point with Barabasi [235 



Question 6. Can adatom detachment from big islands be made easier by elasticity, in comparison with smaller 
islands? 

This has been suggested by Kobayashi |164| on experimental grounds, and by various theorists 235 



237]. If it is 



so, this may produce a current of atoms from big to small islands and favour a finite cluster size. The difficulty is to 
explain this easy detachment. An energy barrier at the surface does not provide an explanation since it opposes both 
attachment of adatoms to clusters and detachment from clusters, as said above. The explanation can result from a 
failure of linear elasticity theory, since the elastic energy gain will be lower than predicted by linear elasticity, when 
the cluster size becomes larger than the threshold Ru m defined above. 

Other possible solutions have been proposed, which suggests a last question. 

Question 7. Is the picture given in chapter K] correct? 

The postulates of this description are the following. 

i) Microscopic details, e.g. the strain dependence of the diffusion constant, have been disregarded. 

ii) The only contributions to the energy are the elastic energy (the volume integral of a continuous energy density) 
and a strain-independent surface energy. The surface stress, which generates a strain-independent surface energy, is 
neglected. 

iii) There is no three-dimensional diffusion, no mixing of the substrate and adsorbate materials. 

iv) The clusters have their equilibrium shape and the equilibrium energy. 



The modulation of the diffusion constant has been addressed in various papers reviewed by Schroeder & Wolf [ 238 1 . 
In our opinion, such effects should not be essential for a qualitative understanding of phenomena. For instance, 
they can modify the speed of Ostwald's ripening or the critical thickness, but canno t exp lain the narrow cluster 
size di strib ution. The effect of surface stress has been already addressed in paragraph XF . According to Shchukin 
ct al. 1 203 1 it ca n stabilize an intermediate cluster sizerj As in the case of two-dimensional clusters addressed in 
paragraph XI C , this can explain a narrow size distribution around this size. A more complete investigation seems 



necessary to check whether this interpretation is compatible with the relatively large cluster sizes which are obse rved . 
Chemical mixing depends on the materials and on temperature. In the case of InAs/GaAs, Joyce et al. [15S ] 
have given good arguments in favour of a strong mixing, both in the wetting layer and within clusters. Indeed, 
they succeeded in measuring the total volume of clusters with a reasonable accuracy, and found that it is larger 
than the deposited volume. Mixing can strongly influence the evolution of t he cl usters [239|. In case of 3d cluster, 
the alloy segregation is probably enhanced by strain-driven atomic diffusion [ |240| ] . For instance, one might imagine 
that penetration of the substrate material into the weakly relaxed cluster core lowers the elastic energy, and that 
this penetration becomes more difficult as clusters grow, thus making the growth of big clusters more difficult, in 
agreement with experiments. As mentioned in chapter HI, chemical inhomogeneities can also result from segregation 
or demixing within the adsorbate. An example is provided by the clusters resulting from the deposit of GalnAs alloy 
on GaAs(lOO). Indeed, an enrichment in Indium on the top of clusters was experimentally evidenced by different 



methods [241,242] 



The fact that clusters need an appreciable time to reach equilibrium is test ified by experiments 1 219 243] which 
demonstrate, for example, the occurrence of metastable states. Jesson et al. [244, 245 1 have described mechanisms 
which slow down the establishment of equilibrium. They consider a pyramidal cluster with faceted sides, and argue that 
adding a new layer on a side requires to overcome an increasingly high activation barrier when the size increases. This 
implies that the sides of the clusters are faceted, with a roughening temperature higher than the growth temperature, 
a property which is not necessarily general, even though faceting at room temperature is rather well established (see 

Fig. m. 



XII. STACKING FAULTS AND TWINS 



In this short chapter we come back to kinetic instabilities, those which are present at low temperature but not at 
high temperatures. A particular type of kinetic instabilities was studied in great detail in chapters IV and |yj, namely 
those which arise from the Ehrlich-Schwoebel effect. The emphasis given to Ehrlich-Schwoebel instabilities in the 
present review is justified by the large amount of theoretical work which has cast a good physical insight on these 
phenomena. There are, however, other types of instabilities, and it might be misleading to omit them completely in 



35 This size is rather metastable than stable, since the most stable state is constituted by big bubbles (strictly speaking, a 
single bubble) according to the argument given at the beginning of chapter IX 



this review, since one might be tempted to attribute to the Ehrlich-Schwoebcl effect instabilities which are due to 
other causes. 

Another possible and perhaps more frequent source of growth instability is constituted by stacking faults. These 
defects can arise during the growth of a crystal of a material which has another crystal form with a comparable energy. 

A well known example is provided by the face centered cubic Bravais lattice, which is the most common crystal 
among metallic elements. It is well known that a different stacking of the (111) planes leads to another Bravais lattice 
which is the hexagonal close packed lattice. Each atom has the same number of neighbours in both lattices, the atoms 
form triangles in both cases, and the energy difference can be expected to be small. Similarly, random stacking faults 
which destroy the periodicity are expected to have a low energy and to form rather easily during growth in the (111) 
direction. A growth model in which atoms are represented by hard spheres should reproduce this feature. 

Stacking faults are also possible in semiconducting elements, such as Si or Ge, although the interactions are quite 
different. Indeed, atoms cannot be represented by hard spheres. The rigid unit is not the atom, but tetrahedra of 5 
atoms (a central one and its 4 neighbours). Nevertheless, as in the case of metals, stacking faults in the (111) direction 
have a fairly lo w ene rgy. Indeed they can be obtained by rotating the tetrahedra without distorting them. 



Gallas et al. [246] have observed growth instabilities resulting from stacking faults during the growth of Si(lll) 
in the (111) direction at low enough temperature (about 300°C) although a perfect crystal easily grows at higher 
temperature. These stacking faults observed in the growth of Si(lll) are certainly of kinetic origin since they do 



not appear at higher temperature. We have seen in chapter VIII that stacking faults of thermodynamical origin 



can also appear in heteroepitaxy. The example of Si is interesting because the Ehrlich-Schwoebel effect has also be 



suggested 46 to explain instabilities observed at low temperature is the growth of Si in the (001) direction. The 



controversy about existence or non-existence of an Ehrlich-Schwoebel effect is reported in paragraph |IV A . 

Twins (Fig. [4l]) are more severe crystal defects than stacking faults, in the sense that they do not break only the 
periodicity, but also the orientational long range order. They are directly responsible for the formation of polycrystals, 
as seen from Fig. [42]. As stacking faults, twins can be interpreted as resulting from dislocations, as seen from Fig. |i^. 

XIII. EFFECT OF ADDITIVES (SURFACTANTS) 

In soft matter physics, 'surfactants' are materials which lower the surface free energy. A typical example is soap in 
soap bubbles. 

Applied to crystal growth, the word 'surfactant' has taken a completely different meaning. It designates an additive 
which i) stays on the surface during growth, instead of being incorporated as impurities, and ii) modifies the kinetic 
and/or thermodynamic properties of the surface. Those surfactants which facilitate layer by layer growth with re spec t 



to three-dimensional growth in heteroepitaxy are of particular interest. Examples for Ge/Si are As, Sb, Bi, Te [247 



The coverage in these instances is of the order of 1/3 of a monolayer. Without any surfactant, as seen in paragraph 



IX D, it is possible to grow 3 Ge monolayers on Si(O Ol) at 700 K before forming three-dimensional clusters. With As, 
one can grow 50 layers and probably more | 248| , 249| . At such hig h thickn esses, the stress is relaxed, and this implies 



that the adsorbate is no longer coherent, although it is epitaxial [248, 249 1 



Factors which may explain the effect of surfactants in crystal growth are the following. 
• Reduction of the surface energy. It is the essential property of surfactants in soft matter physics. In the case of 



interest here, it may have the important effect of favou ring the floating of surfactants during growth [247| but is 



not especially appropriate for growth of good crystals [246]. Most of the current theories of surfactant-favoured 



growth invoke modifications of the kinetic behaviour rather thermodynamic effects. 
Modification of the surface diffusion coefficient 246 - [252 1. 



Slow surface diffusion makes the equilibrium state harder to reach and therefore should help to avoid thermo- 
dynamical instabilities in heteroepitaxy. A way to reduce surface diffusion is to reduce temperature, but this 
can favour kinetic instabilities. Surfactants can therefore be a good alternative method. Of course, diffusion 
should not be too low. Another caveat is that a low diffusion coefficient increases the adatom density, and this 
effect tends to accelerate the evolution, so that the overall result is not easy to evaluate quantitatively, except 
by Monte Carlo simulations. 

Voigtlaender et al. [ p47| have observed the effect of several surfactants during the growth in the (111) direction 
i) of pure Si(001) and ii) epitaxial Ge/Si(001). They argued that surfactants which favour layer by layer growth 
in heteroepitaxy are precisely those which reduce surface diffusion in the pure material. Conversely, 'surfactants' 
which accelerate diffusion (Ga, Sn, In, Pb) favour three-dimensional growth. 



The paper of Voigtlaender et al. [247] has the merit to propose a simple picture. It also suggests an explanation 
of the different effect of different surfactants: those which accelerate diffusion are elements which just provide 
the number of electrons necessary to passivate dangling bonds, while those which provide more electrons can 
also retain adatoms and therefore slow down diffusion. 

Even if not fully correct, a simple picture has al ways the merit to provide a starting point for further progress 



and discussion. A point which has been criticized [246,253] is the method used by Voigtlaender et al. to estimate 



the surface diffusion velocity. Their criterion was the distance between islands after deposit of a submonolayer 
(see App. |b|). In the simplest cases, a small surface diffusion favours a short distance between islands. However, 
the case of surfactant-covered Si(OOl) at high temperatures may be more complicated as suggested by first 



principle calculations and Monte Carlo simulations of Kandel & Kaxiras 1 253 1 



• Modification of the detachment rate from steps. This is the other way to slow down the evolution toward 



equilibrium. Kandel & Kaxiras [253] have performed Monte Carlo simulations where a barrier to detachment 
from steps was taken into account in a phenomenological way and obtained a good agreement with experiment. 
In spite of the controversy between Kandel & Kaxiras and Voigtlaender et al. , both points of view are to some 
extent complementary. The remarkably synthetic picture given by Voigtlaender et al. provides much insight, 



but it involves oversimplifications which are not always confirmed by first-principle calculations [254] 



• Modification of the Ehrlich-Schwoebel effect. For example, Markov |255| has suggested that a surfactant can 
induce an inverse Ehrlich-Schwoebel effect, which favours downward motion of the atoms. Roughly speaking, 
the reason would be that the atoms willing to stick to an upper terrace (fc_ process of Fig. m\) are prevented 
to do so because the surfactant is already there, while an atom coming from above (k + process) can easily push 
the surfactant. 



• Segregation of the surfactant on the lower terraces [p56| . The result would be that the adatom mobility would 
be different on the lower and upper terraces. 

This enumeration shows that the action of surfactants is complex and controversial. 



Note that, when surfactants do not prevent cluster formation, they can affect their size and shape [257,258 



XIV. CONCLUSION 

This review is the common work of theorists and experimentalists, and has been an opportunity for many discussions 
between them. 

Such discussions are not always easy. Theorists tend to introduce oversimplified models, which help to understand 
some basic mechanisms, but may conceal other mechanisms which are not less basic. For instance, they have devoted 
more attention to phenomena which can be described by the simple S.O.S. model H than to twinning. 

This review is, in this sense, rather theoretical. For instance, the distinction between kinetic and thermodynamic 
instabilities is an oversimplification because both kinetics and thermodynamics may contribute to the same process. 
Such mixed effects have been taken into account in the present review, but more attention should be paid to this 
point in the future. 

Another source of misunderstanding between theorists and experimentalist, and technologists even more, is that 
when a difficulty appears, the theorist tries to understand it, while the technologist tries to avoid it, usually by using a 
more complicated method. For instance, instead of wondering how one can have a better self-organization of quantum 
dots, one can organize them artificially. Artificial organization of quantum dots is clearly outside the scope of the 
present review, but can be a better technology. Even then, it is of interest, both for fundamental and technological 
purposes, to understand the self-organization of quantum dots. This is still an open problem. 

Another point which is not completely understood is why the Asaro-Tiller-Grinfcld theory seems to be applicable 
to surfaces which are believed to be singular. 

In both cases, qualitative hints have been suggested in the present review, but a quantitative treatment is still to 
be done. It will presumably be numerical to a large extent, and we hope that the qualitative considerations presented 
here will help in the choice of the models to use in the simulations. 

The comparison between theory and experiment is probably easier in heteroepitaxy because most of the basic 
mechanisms are well identified. But the growth of a simple material can also be difficult. A typical example is 
diamond, of which it is impossible to produce big, artificial crystals. We would have liked to include this problem 
in this review, but it looks difficult to isolate simple concepts in a complicated chemical game, in spite of interesting 



theoretical attempts [25£[. The above considerations suggest directions for future research. We would recommend 



-A deeper fundamental understanding of basic mechanisms in experimental facts. 

-A more detailed theoretical description, to compare with a more systematic experimental study of the various 
factors which affect the various physical or chemical systems. 

This difficult program, is now being carried out by many research groups throughout the world, and a rapid progress 
can be expected. 
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APPENDIX A: LIST OF SYMBOLS 



a 

d 
D 

2d, 3d 
c 

E 
ES 

h 

J, J 
k B 
m, rh 

n 

(1 

R 

T 

T R 
v 
V 
x 



y 

z 



5a/a 
7 



r 

e 



fi 
v 

P 
a 
a 

n 

A 

T 
I 



atomic distance, or a related length 

distance, or space dimension 

surface diffusion coefficient of adatoms 

jargon for two-dimensional and three-dimensional 

monolayer thickness 

Young modulus 

Ehrlich-Schwoebel (effect, length, ...) 

Flux of atoms in MBE 

typical linear size of bumps, mounds, clusters, etc. perpendicular to the terrace direction ('height' 

one and two-dimensional current density (in atoms per second and per length unit) 

Boltzmann constant 

slope (has the dimension of a recipr ocal l ength in chapter IV , but is dimensionlcss in chapter [V|) 

coarsening exponent, see paragraph IV E 

a wave vector 

typical linear size of bumps, mounds, clusters, etc. in the terrace direction 

temperature 

roughening transition temperature (depends on the surface orientation) 

velocity 

volume 

a coordinate parallel to the terraces 

In chapter DO, quantity defined by ( |189[ ) 

concentration of an alloy 

a coordinate parallel to the terraces 

the coordinate perpendicular to terraces 

in chapters IV and [V], this coordinate divided by the monolayer thickness c) 



kinetic coefficient, ratio of the current density to the surface slope (see paragraph [VD I) 

coordinate x,y,z (e.g. in a summand) 

l/(k B T) 

lattice misfit (positive if the adsorbate is bigger than the substra te) 

step free energy per unit length, ste p stif fness (paragraph IV D 2) 

dimensionless exponent (paragraph |lVF ) 

coordinate x, y, z (e.g. in a summand) 

f = 0,37 



strain (chapters VIII and following) 

dimensionless distance from the instability th reshol d (chapters IV and |v|) 

reciprocal width of a domain w all ( paragraph IV F ) 

curvature of a step (paragraph VB ) 

chemical potential 

Poisson ratio (chapters VIII [X and |x|) 

number of adatoms per unit area 

surface stiffness, surface energy per unit area 

stress 

area per surface atom (chapter M) 

Elastic constants 

area of the growth front 

free energy 

distance between steps 

diffusion length 



APPENDIX B: THE DIFFUSION LENGTH AND THE NUCLEATION LENGTHS 



The diffusion length £jj is the typical distance travelled by an adatom on a flat, high-symmetry surface before being 
trapped. The requirement on the surface orientation ensures that the adatom is not trapped by a preexisting step, 



but by another adatom (nucleation process) or by the closed step of a growing island. The length Id can also be 
defined as the average distance between islands, in the stationary regime when the island density does not change (see 
Ref. p60|). 

The simplest model (a nucleus of two adatoms is stable and it does not diffuse) requires solely the knowledge of the 
diffusion constant D and the flux intensity Fq. Dimensional analysis suggests the expression (the lattice constant is 
put equal to one) Id = (D/Fq) . A very simple argument allows to determine the exponent <5. 

If P{£) is the probability of nucleation per unit time in a region of size £, in the stationary regime it is given by the 
number N of incoming adatoms times the probability p that an adatom encounters another one. N is simply given 
by Fg£ d (d — 1,2 being the dimension of the substrate) and p is the product of the number of distinct sites visited by 
the adatom (« £ d ) times the probability a site is occupied, i.e. the (average) density of adatoms. Finally, we have 

P{£) = F £ 2d (p) (BI) 

The diffusion length is given by the condition that the probability of nucleation in a region of size £r> during the 
deposition of one layer of material is of order unity: 

P(£ D )-1/F ^l =*. £l d ( P )~l (B2) 

Since there are no preexisting steps, adatom density is not influenced by ES barriers: (p) may be simply evaluated 
as the solution of the diffusion equation Fq + DW 2 p = in a region of size £jj, with p vanishing on the boundary: 
(p) w (F /D)£ D . The final result is: 



«) (B3) 

So 5 = 1/4 and S = 1/6, respectively in I+f and 2+1 dimensions. The hypothesis that a dimcr is stable and does 
not diffuse is valid only at low temperature s: when the size of the largest cluster which is mobile and 'unstable' is 



larger than one, the exponent 5 is modified [26C2GI ] 



In the main text the 'region of size P must be meant as a terrace of size £ which therefore can be a top, a bottom, 
or a vicinal terrace (see Fig. |l0| ). This distinction is necessary in the presence of step-edge barriers, because the 
average density (p) depends on the type of the terrace. In this context it is preferable to speak of 'nucleation lengths' 
for a top (£„), a bottom (£„), and a vicinal (•£„) terrace, rather than of diffusion lengths. Since ES barriers arc 
irrelevant for a bottom terrace, we simply have £^ = £d- Conversely, for top and vicinal terraces (p) is obviously 
an increasing function of £es- For a vicinal one it has a very weak dependence on £es and goes to a finite value for 
£es — > °o; i n contrast, (p) diverges for a top terrace. In 1+1 dimensions the nucleation lengths are found via the 
implicit equations ]5q , |66[ : 

{£ T n f + 6£es(0 3 -£ 4 d=0 1+1 dim (B4) 

(O 5 + U es {O a - £ D {£ v n + £es) = (B5) 

whose solutions are plotted in Fig. [44|. 

In 2+1 dimensions we must assume that terraces have circular edges, in order to obtain an analytical expression for 
the adatom density. In fact, if p depends solely on the radius r the diffusion equation reads p"(r) + (l/r)p'(r) — —Fq/D, 
whose general solution is p(r) = po + Pi In > — {Fo/^D)r 2 . For bottom and top terraces, analyticity in r = requires 
pi = 0. The diffusion length is found to be £d — (SD/Fq) 1 ^ 6 , while the nucleation length for a top terrace is 
determined by the equation: 

{£ T n f + U ES {£ T nf ~£%=0 2+1 dim (B6) 

whose limiting solution for £es/£d ~^ 00 differs from the one dimensional case. In units of £d, £„ — (1/6^.es) m 
1+1 dimensions and (^ ~ (1/Mes) 1 ^ 5 in 2+1 dimensions. 

The case of a vicinal terrace is not so easy: in fact it is characterized by two different quantities, the inner (ri) 
and outer (r^) radii. If £ = r-i — r\, it is no more true that the number N of incoming atoms per unit time is simply 
N « Fq£ 2 . The average adatom density itself is a function of both r\ and 7-2, but in the limit t\ ^> £ it depends solely 
on £: (p) w (Fq/2D)£ 2 . In the opposite limit (7*1 -C £) the attention must be paid to the logarithmic term, which 
diverges for 7*1 = 0. This limit is relevant for the vicinal terrace neighbouring a top terrace, when £es ~ * 00 (because 
in this limit £„ — ► 0). 



APPENDIX C: QUALITATIVE CONSIDERATIONS ON THE COARSENING EXPONENT 

Let us develop here some qualitative or semiquantitative considerations on the coarsening exponent n. Krug |9J] 
and Tang et al. p3| ] lay stress on noise, with the following argument: the 'coarsening time' is set by the condition 
that the fluctuations of the height of a mound of size L and average height h, induced b y sho t noise, are of the same 
order of the height itself. In chapter IV B we wrote that, because of shot noise: Ah ~ \JtjL d . Now we use the extra 



condition that Ah ~ h ~ m*L, to* being the largest slope in the profile. As a general rule: m* ~ L*, where ip — 
for a current with zeros at some finite too and ip = 1 by following the Burton-Cabrera-Frank model [it is sufficient to 
insert the current ffijj ) with 7=1 into Eq. (pTJ)]. Therefore we will have: 

L^-y/W =* L(t)~r with n= d + ^ + l) (CI) 

If the slope of the mounds is constant, ip = and we obtain n = l/(d + 2), i.e. n = 1/3 in 1+1 dimensions and 
n = 1/4 in 2+1 dimensions. In one dimension, the value n — 1/3 agrees with exact theories |85| , |86| ; in two dimensions, 
the value n = 1/4 seems to agree with the coarsening exponent found on (100) oriented substrates (quartic symmetry) 
or on isotropic substrates |B7 68 , EKU] , but there is no agreement if the substrate has a triangular symmetry ]8l| , |91| , in 
which case n = 1/3. 



A d iffer ent, nonrigorous approach to coarsening has also been developed by Golubovic |262j and by Rost and 



Krug 1 263 1 . We report it here (in a slightly modified way) because it may clarify some differences between 1+1 and 
2+1 dimensions, and —conversely to the previous argument— it neglects noise and stresses the deterministic character 
of coarsening. We will concentrate on the case of an asymptotic constant slope (tp — in the previous notation), with 
in-plane symmetry. 

The evolution of the surface is governed by the current: 

3 = 3es + 3m = am{\ - m 2 /m 2 ) + KX7 2 m (C2) 

As coarsening proceeds the two terms vanish separately, because \m\ — * too almost everywhere. So a first question 
is: Do they remain of the same order, or does one of the two become negligible? The answer is: They must keep of 
the same order, otherwise the surface would not have the displayed 'regular' profile and coarsening (a deterministic 
process) would not be possible. 

This condition implies a relation between the coarsening exponent n and the exponent d which governs the asymp- 
totic behaviour of to: (mo — |m|)/mo ~ t~^ . In fact a qualitative evaluation of the two terms give: 

\3es\~\3m\ =* *"*~^2 =► ,9 = 2n ( C3 ) 

Once established that \jes\ ~ \Jm\, two possibilities are left: \jes + Jm\ ~ \Jes\ ~ \Jm\ or \jes + 3m\ is much 
smaller than each single term in the current. To be more definite let us writ e the evolution equation for the interface 
thickness w 2 (t) = {z 2 (x,t)), starting from the evolution equation for z(x,t) p63| : 

\d t w 2 {t) = (to ■ j ES ) - K((V 2 z) 2 } (C4) 

We have already argued that the two terms on the right hand side are of the same order of magnitude. Now their 
difference may be i) of the same order too [and therefore dtw 2 (t) ~ (to • Jes)] or ii) it may be smaller. In the two 



cases we can write [263 



1 d t w 2 (t)<(rh-JEs) (C5) 



2 
In the same spirit as before, its qualitative evaluation gives: 

H&^ <t -* => 2n<l-tf =* n<\ (C6) 

t ~ ~ ~ 4 y ' 

where the equal sign corresponds to the case (i) mentioned just above. The inequality n < \ applies to 2+1 as well 
as 1+1 dimensions: since our model gives L(i) y lni in 1+1 dimensions (analytical result) and L(t) ~ t 1 / 4 in 2+1 
dimensions (numerical result), the conclusion flC6| ) on one side supports these results, and on the other side emphasizes 
an important difference between one and two dimensions, due to topology: in 1+1 dimension the current j which 
contributes to the evolution of the surface is vanishing small with respect to each term of the current itself. 



APPENDIX D: NONLINEAR ANALYSIS OF THE DYNAMICS OF A SINGLE STEP, IN THE 

PRESENCE OF STEP-EDGE BARRIERS 

We have to solve the differential equation: 

DV 2 p - p/r + F Q = (Dl) 

in the (x,y) plane, with y > C,{x,t) (one-sided model). Boundary conditions, if the step is locally in equilibrium, are: 



p(x,oo,t) = tF 

where k is the curvature: k = — Cxx/[1 + CS 3 ^ 2 - 

The dynamics of the step is determined by the following relation: 

V n = QDd n p\^ = QDft ■ Vp|^ 

where v n is the velocity normal to the step profile £(#,£): 

Vo+C 



v n = (0, v Q + C) • n 



v/i + Cl 

In the previous equations, vq is the velocity of the straight step and ft (the normal vector to the step) is: 



(D2) 
(D3) 



(D4) 



(D5) 



(D6) 



1. Dimensionless equation 

By introducing a new variable u = p — Fqt and by rescaling (x, y) with respect to x s and t with respect to r, 
Eqs. ( plp2| , p^ , p4| ) read: 

(D7) 
(D8) 

(D9) 



V 2 u-u = Q 



u{x,C,t) = -- - 



re, 



£ [i + C 2 ] 3/2 

u(x, oo, t) = 

— + c = Q(u y -( x u x ) 



where the quantities T = p\ ] t /x s and £ = T/tAF had already been defined in the body of the article. 

As explained in the main text, we introduce the small dimensionless parameter e = (1/2) — £ and the new variables: 



X = x^ft 
T = te 2 

H(X,T) = ((x 1 t)/e 



through which Eqs. (|D7|]d|,|d|) read 

eu x 
u(X,eH,T) = 



tuxx + uyy — u = 

r 



TH 



xx 



nr 



1/2 -e 
Now, u and H are expanded in powers of e 



1/2 - e [1 + e 3 H 2 x ] 3 / 2 

e 3 H T = n(u Y - e 2 H x u x ) 



u = u + eui + e 2 u 2 + e 3 u 3 + 
H = H + eH 1 + € 2 H 2 + e 3 H 3 



(D10) 
(Dll) 
(D12) 

(D13) 
(D14) 

(D15) 



(D16) 
(D17) 



2. Differential equation and boundary conditions at different orders 



Now the task is laborious, but straightforward: replace the expansions for u and H in Eqs. (D13,D14 D15|) , taking 
in mind that we must keep all the terms till order e 3 . For the differential equation it is easy, and we obtain: 



uqyy - u = 
uqxx + {uiyy - U\) = 

UlXX + (u 2 YY - Ui) = 
U 2 XX + (u 3Y Y ~ U 3 ) = 



For Eq. (D14), we start by rewriting the right-hand-side: 

u(X, eH, T) = -2r(l + 2e + 4e 2 + 8e 3 ) - Te 2 {H 0X x + eH 1X x) 
The left-hand-side must me handled with care: 

3 



i(X, eiJ, T) = Y^ e n Un{X, eH, T) 



n=0 



where: 



u n (X, eiJ, T) = u n + u nY eH + u nY Y 



e 2 H 2 



3^3 



U n YYY - 



(D18) 



(D19) 



(D20) 



(D21) 



and H n itself must be expanded, using Eq. ( D17 ). So, at third order in e, we obtain: 
u n {X, eH,T) =u n + u nY {eH + e 2 H x + e 3 H 2 ) + ^pL( e 2 H 2 + 2e 3 # #i) 



UnYYY ^3 u3 
a e -"0 



(D22) 



where, here and in the future, an expression of the form u n Y..Y must be evaluated in Y = 0. 
By coming back to the expression of u(X, eH, T) (see Eq. (D20|)), we have: 



Hi 



u(X, eH, T) = u Q + e(u aY H Q + u x ) + e (u 0Y Hi + u 0Y y-^- + ui Y H + u 2 



-e (uqyH 2 + uqyyHqHi + u yyy- 



G 



u 1Y Hi + u 1Y y~z- + u 2Y H + u 3 ) 



(D23) 
(D24) 



Comparing this expansion with the right-hand-side of Eq. ( D19 ), the boundary condition ( D14 ), at different orders 
in e writes: 



u = -2r 

u 0Y H + tii = -4T 



m 



uqyHi + uqyy-^- + uiyH + u 2 



-sr - h 



oxx 



r 



(D25) 



H 3 H 2 

u 0Y H 2 + uqyyHqHx + uqyyy—t- + uiyHi + uiyy-£- + u 2Y H + u 3 = ~16r - THixx 

o z 



Finally, we have to evaluate Eq. ( |D15| ) up to e 3 . This requi res to calculate uy at the order e 3 and ux at the order e. 
Concerning uy{X,eH,T), the result is identical to Eq. (D24), once that u n is everywhere replaced by u n y (and u n Y 
by u n YY, ■ ■ ■)■ Concerning ux(X, eH, T) at first order, since uq does not depend on X, the result is easily obtained: 



u x (X,eH,T) = eu lx (X,0,T) 



By using this expression, but not yet expanding uy, Eq. (D14) reads: 

2f(l + 2e + 4e 2 + 8e 3 ) + fi- 1 e 3 iJ 0T = u Y - e 3 H 0X u lx 



(D26) 



(D27) 



If uy is expanded according to the criterion given above, the velocity of the step (Eq. ( D15 )) is determined at the 
various order in e by the following relations: 



2r = uqy 

AT = uqyyH + uiy 



m 



8r — uqyyHi + uqyyy—^- + UiyyHq 



U2Y 



(D28) 



H 3 H 2 

16r + ft H t = u yyH 2 + «oyyy#o#i + u yyyy—^~ + uiyyHi + uiyyy—z- + u 2 yyH + u 3 y — uixHox 



The next and final step is to solve, order by order, Eqs. (D18) with boundary conditions (D25) and (D28). 
Zero order - We have to solve: uqzz — uo = 0, with uq = — 2r and uqz = 2r as boundary conditions. The solution is: 



u (Z) = -2Tcxp(-Z) 



(D29) 



We remark that the two boundary conditions are indeed equal, and this will be true also at the first and second order 
in e ! This is not surprising, because Hot —and therefore the motion of the step— enters at the third order of the 
boundary conditions (D28) only. 
First order - The solution of the equation u\zz — U\ = is: 



ui(X,Z,T) = A 1 (X,T)exp(-Z) 
and the boundary condition uqzHq + u\ = — 4r implies: 

2TH Q (X, T) + A x (X, T) = -AT 
Second order - The differential equation uixx + i u 2zz ~ W2) — gives the solution: 

u 2 (X,Z,T) = A 2 exp(-Z) + lXX Zexp(-Z) 



m 



where A 2 (as well Ai) is a function of X and T. The boundary condition uozHi+uozz-^-+uizHq+u 2 
gives: 

r A 2 = — 8 — 2Hi — H — 4:Hq — Hqxx 

Third order - The differential equation u 2 xx + (v-3zz — W3) = has the solution: 

A\xxxx 



u 3 (X,Z,T) = 



A, 



A 2 XX \ y AlXXXX y 2 



exp(— Z) 



(D30) 
(D31) 

(D32) 

-8T — H xx 

(D33) 

(D34) 



This time the two boundary conditions (D25) and (D28) are not equivalent and this impli es a condition on Hq 
under the form of a differential equation which governs the dynamics of the step. In fact, Eq. (D25) gives: 



T^Ag + 16 + H 1XX + 2H 2 + 8H 



H 3 



2Hl + 2HqH x +4H 1= 



(D35) 



and Eq. ( D28 ) gives: 



r-M, 



16 + H 1XX + 2# 2 + 8H 



m 



2H + 2HqHi 



+4^ + (nr^HoT + (S/A)H 0X xxx - H* x + 2H axx = 
Their comparison provides the equation for Hq we were looking for: 

(nry^THa = -2d 2 x H - (3/4)d x H + (dxH ) 2 



(D36) 
(D37) 



APPENDIX E: SCALING LAWS FOR COHERENT EPITAXIAL CLUSTERS 
1. Scaling of the elastic free energy of a cluster with respect to the misfit 



The first scaling law concerns the influence of the misfit for an adsorbate of fixed shape and volume and fixed elastic 
constants. The misfit can be represented by an external or force dipole density (or 'stress') acting inside the adsorbate. 
This stress will always be assumed weak enough, so that the response is linear, i.e. the strain at a given point is 
proportional to the misfit da/ 'a, so that the elastic energy density is proportional to (Sa/a) 2 . The total elastic energy 
is therefore also proportional to (Sa/a) 2 . The determination of the proportionality coefficient would be difficult. It is 
an integral which involves the elastic linear response functions (or 'Green's functions') of the system, which are very 
complicated and depend on the adsorbate shape and elastic constants. 



2. Scaling of the elastic free energy of a cluster with respect to its volume V 



The second scaling law concerns the size dependence of the elastic free energy for an adsorbate of a parti cular 
s hap e, whose linear dimensions can be varied but keep fixed ratios. The elastic free energy is the sum of (149) and 
(152) and can be written as 



5T 



(a) 



d 3 r$ (f, e xx (r),e xy (r), ...e zz (r)) 



(El) 



where the function $ is defined by ( pL49|) and (|152|). 

Expression (pHl) is very general, and general expressions are sometimes brain-twisting. It may therefore be a helpful 
exercise to consider the example of a cluster which is a half-sphere of radius R. If t he o rigin of the coordinates are 
chosen at the center of the sphere, the linear part of $ can be written, according to (|149|), as 



(j)('m) 



r\e 



, c raj ^xy ; 



.e zz ) — Const x 



5a , 



1 mi 



]0(z)O(R-r) 



(E2) 



where 9(z) — for z < and 6{z) — 1 for z > 0. The quadratic part <j)('? uad ) can easily be written using (152). 

Consider now (Fig. |49b) another structure of the adsorbate which is deduced from the first one (Fig. |45|a) by the 
similarity transformationPj 



r —>■ Xf 
The elastic free energy of the new structure has the form 



ST. 



W 



d 3 r$(r/X,e xx {r),e xy (r),...e zz (r)) 



(E3) 



(E4) 



where the function $ is just the same as before! The proof is easy and left to the reader, who can first treat the 
example of a half sphere. This property is true even if anharmonicity is taken into account. However, it would not 
be completely true if the atomic, discrete nature of matter were not disregarded. 
Now, if one makes the change of variables 



formula (E4) reads 



£ an {r) = n an {r/\) , r = Xp 



SF™ = X A J d A p<b{p, Vxx {p), Vxy {p),...n zz (p)) 



(E5) 



which is identical to (E2) apart from the factor A 3 and the different names of the variables. If (E2) is minimized 
by the strain e an (r) = e„ ,(f), then (E5) is minimized by rj an (p) = e° a {p), and therefore (K4) is minimized by 



^a,i(f) — £& 7 ( 7? /^)- And the minimum of (E4) or (E5) is equal to the minimum of (E2) multiplied by A 3 . Since the 
volume of the transformed cluster is equal to the volume of the original cluster multiplied by A 3 , the elastic energy of 
a coherent, epitaxial cluster of a given shape is proportional to its volume V. 



3. Scaling of the strain at long distance f of a cluster as a function of its volume V 



The problem is analogous to the previous one but now we consider the elastic displacement u(r), at a fixed, large 
distance r of the center of the cluster. In the special case of an isotropic elastic medium, the complete calculation has 
been done by Boussinesq [ 205 1 . 



The notation r designates the points of the totally constrained adsorbate, so that the actual locations of the atoms are 
r + u(r), where u is the elastic displacement 



The displacement at distance r turns out to be proportional to 1/r 2 , so that the strain is proportional to 1/r 3 . We 
shall argue here that this scaling is also true for an anisotropic elastic solid. As a lemma, we need to show first that, 
for fixed r, the displacement and the strain are proportional to V. 

Let us assume that the atomic displacements in a slab (S) just below the adsorbate are known. Its knowledge 



determines the displacement and strain in the remainder of the crystal through the equations of elasticity (154). 
Outside the slab (S), these equations are independent of the existence or non existence of the cluster. On the other 
hand, in the slab, the displacements Hi are the same as those which would arise from external forces /; acting on 
the n atoms inside the slab in the absence of the adsorbed cluster. The forces fi can in principle be determined by 
solving the equations of elasticity which usually yield the Uj's when the forces fi are known, but can be used also to 
determine the /j's if the Ui's are known, since there are 3n linear equations for 3n unknowns fi a . Since the total force 
must vanish, it may be preferable to introduce a force dipole density /^(r*) 

The displacements Ui = u(r) at distance r are related to the forces by linear equations of the form 

u a {f)= [ d 3 r' Y, T Q7? (r - 7)^(7) (E6) 



(S) 



7« 



where the response function T a7 ^(f— f") is independent of the size and shape of the cluster, since the cluster has been 
removed. It is also independent of the size and shape of the slab. Actually, T ai ^(f — f*) is a Green's function of an 
clastic solid limited by a plane, which can be found in textbooks... at least in the case of an isotropic solid or a cubic 
crystal limited by a (001) surface. 



We now make the similarity transformation (E3). It can easily be shown that the term linear with respect to the 
fictitious forces satisfies (pkf), so that the force or force dipole densities are the same in two corresponding points. 
Now, at a long distance, the dependence of r Q7 j(r — f") with respect to f" can be ignored and, if the cluster is centered 



at the origin 0, ( |Eq ) reads, before the similarity transformation, 

After the similarity transformation, if the transformed slab is called (S"), the displacement at the same point ris 
«L 2) (r) = £Wf) f d 3 r'^(r'/X) = A 3 ]Tr Q75 (r) / d 3 r"/^(f") 

where r" = r'/A. Comparison of the last two formulae shows that uS 2 '{r) = A 3 u' 1 '(r). The elastic displacement and 
stress at a long, given distance r of a coherent, epitaxial cluster of a given shape is proportional to its volume. 

4. Scaling of the strain at long distance r a cluster as a function of r 

A consequence of the previous paragraph is that the strain on the adsorbate surface at a long distance r from the 
center of a cluster decays as ?*~ 3 . Indeed, the strain should be proportional to the volume, so that the strain eo(r) 
produced by the clusters (C) and (C) satisfies ei(Ar) = A 3 eo(Ar). But we have seen that £i(Ar) = eo(r). Therefore 
eo(Ar) = A _3 eo(^*) in agreement with our statement. In particular, the strain produced by a single adsorbed atom at 
a distance r on the substrate surface is proportional to r~ 3 . 

5. Elastic interactions at long distance r between clusters or adatoms 

One can deduce how the elastic interaction between two clusters at a long distance r behaves. Indeed, the energy 
change when depositing an adatom can be shown to be a linear function of the local strain. Therefore, there is an 
elastic interaction between adsorbed atoms 

V ei (r)=Const/r 3 (E7) 

The constant can be shown to be positive, so that the elastic interaction is repulsive |3|,|[ . There is no contradiction 
with the above statement that the total elastic energy is negative. 



The el astic interaction between two adsorbed atoms at a long distance r is given by a formula analogous to (KT]), 
which is dl9C| ). 

The result ( jE7| ) can be used to determine the scaling of the interaction energy of a set of coherent clusters as a 
function of their average distance r or their average diameter R at constant coverage N/A. The distance r will be 



assumed to be much longer than the average cluster diameter R. According to (190), the elastic interaction must be 



proportional to 1/r multiplied by the square of the average number n of adatom per clusters (proportional to R ) 
and by the number of clusters J\f which is proportional to 1/r 2 . Since the total adsorbed mass is proportional to 
N = nJV and fixed, R 3 should be proportional to r 2 and the interaction energy between clusters is 

Pinter ~ Const x J\fn 2 /r 3 ~ Const x R 6 /r 5 ~ Const/r . 

The constant is certainly positive since it is positive for a single atom. Thus, the 'intercluster' elastic energy Jointer 
is a decreasing function of r and favours large clusters. In addition there is an 'intracluster' energy discussed earlier 
in this chapter. Since it is proportional to the cluster volume, i.e. to the number of atoms, the total intracluster 
energy is not affected by atom exchange, at least if the modification of the shape is ignored. A more detailed analysis 
taking shape modification into account does not modify the conclusion that the average cluster size should increase 
steadily with time, even if elasticity is taken into account. The possible reasons of the experimental facts which seem 



to contradict this conclusion are reviewed in paragraph XIII 
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Step free energy per unit length 7 
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Typical radius of thermal fluctuations (bumps and valleys) 
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Typical height Sh of thermal fluctuations 
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Excess free energy density of a height modulation 
of small amplitude Sh and wave vector q 
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Variation of the free energy density resulting from a rotation of 
the surface of small amplitude 89 around a high symmetry orientation 
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Continuous changes of surface height 
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TABLE I. Some typical properties of an infinite, plane crystal surface above and below Tr. The step free energy per unit 
length 7 is assumed to be isotropic, and therefore equal to the step stiffness. The infinite value of Sh above Tr is an intuitive 
consequence of the property R = 00, which in turn results from the formula R fa fcflT/7 since 7 = above Tr. c is the 
monolayer thickness. 
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TABLE II. Details on the table are given in the main text. When a box is empty, it means that the datum is not available. 
"The slope is given in degrees and/or through the orientation of the corresponding facet. The notation l°/2° means that 
the slope is in between one and two degrees. Authors study the distribution of the local slope and find that it is peaked at 
2° after 0.5 fj,m have been deposited, and at 3° after 1 /im. c The experimental finding of a slope corresponding to 2.4° is 
modified to 5.6°, because the former value is considered an 'average' one. d The values of L correspond to 2n/L* , L* being 'the 
diameter of the diffraction ring' in the SPA-LEED angular profiles. e In the case of GaAs, 1 ML means one ML of gallium and 
one ML of arsenic. * AFM = Atomic force microscopy; STM = Scanning tunneling microscopy; TEAS = Thermal energy atom 
beam scattering; LEED = Low energy electron diffraction; SPALEED = Spot profile analysis LEED; RHEED = Reflection high 
energy electron diffraction. 



FIG. 1. Cross-section of a ZnTe/CdTe multilayer seen by low resolution (a) and high resolution (b) electron microscopy. 
Narrow white stripes in (a) are two monolayer thick ZnTe inclusions. Since the lattice parameter of both components is quite 
different, phase separation into ZnTe and CdTe would presumably lower the free energy. Thus, the system is in principle 
unstable or metastable. Courtesy of N. Magnea and J.L. Pautrat, C.E.A. Grenoble. 

FIG. 2. Four mechanisms which can produce a growth instability, (a) Diffusion instability, (b) Kinetic instability due to 
step-edge barriers, (c) Thermodynamic instability due to different lattice constants, a and a + 5a, of the substrate and the 
adsorbate. In the present case 8a > 0. (d) Geometric, shadowing instability. 

FIG. 3. (a) Principle of a diffusion instability. If a small bump tends to form on a solid nucleus growing in a supersaturated 
solution, diffusing atoms (circles) tend to go to the bump which thus grows more rapidly than the rest of the nucleus. Matter 
diffusion on the solid (dashed arrows) has a stabilizing effect, i.e. tends to destroy the bump, (b) A snowflake. (c) The 
Mullins-Sekerka instability in directional solidification. The initially plane solid-liquid interface takes a wavy shape which can 
reach a stable shape if the velocity v of the interface is low enough. 

FIG. 4. A two-dimensional DLA pattern from the computer (a) and its experimental realization by electrodeposition of Zn 
in the electrolysis of a quasi-two-dimensional solution of ZnS04 (b). Courtesy of S. Bodea and P. Molho, Laboratoire Louis 
Neel, CNRS, Grenoble. 

FIG. 5. Schematic description of MBE growth in the case of a III-V semiconductor, e.g., GaAs. Each cube represents a cubic 
unit cell. Note that a large proportion of As atoms evaporate, while Ga atoms do not. 

FIG. 6. Energy gap and lattice constant for a few III-V compound semiconductors. 

FIG. 7. Transmission electron microscopy image of Alo.4sIno.52As lattice matched grown on InP(OOl) by MBE (V /III BEP 
ratio equal 20, growth rate equal to 1 fim/h and growth temperature equal to 450° C ) revealing a fine quasi-periodic contrast 
due to the presence of In-rich and Al-rich clusters : (a) plane view, (b) cross section . Courtesy of O. Marty, Univ. Lyon I. 

FIG. 8. TEM image of Gao.47Ino.53As grown by MOCVD showing CuPt ordering along (III) and (11-1). Courtesy of O. 
Marty and C. Pautet, Univ. Lyon I. 

FIG. 9. Two examples of carrier confinement (a) in a Gao.43Ino.57As quantum well between Alo.4sIno.52As barriers; (b) in a 
heterojunction between highly n-type AlInAs and lightly n-type doped GaAs. 

FIG. 10. Surface profile with top, bottom and vicinal terraces. D is the diffusion constant and k± are the sticking coefficients 
from above and below (see the main text). 

FIG. 11. The adatom uses its condensation heat (a) to look for a high-coordination site or (b) to knock-out and replace 
an edge atom. Except for simple cubic structures, the solid-on-solid constraint implies (c) that the capture area of a higher 
coordination site is larger. Dotted circles are newly incorporated atoms, directly from the flux (a,c) or after a knock-out 
process (b). 

FIG. 12. Profiles for the different potentials Vim). Full line: model I, Eq. (E3). Short-dashed: model II with 7 = 1, Eq. @. 
Long-dashed: model II with 7 = 2, Eq. (Uq). The behaviour at small m is the same in all the cases: V(m) ~ am 2 /2. 



FIG. 13. (a) Numerical solutions of the four-state clock model ( |6q ) and the surface growth model with (b) four-fold in-plane 
symmetry (p7j) and with (c) in-plane invariance ( pq ) . Figures (a) and (b) are taken from Ref. |6q] and are courtesy of Martin 
Siegert. White, light and dark grey, and black domains correspond respectively to the orientations (1,1), (1,-1), (-1,1) and 
(-1,-1). Figure (c) is unpublished and is courtesy of Martin Rost. 

FIG. 14. Example of a mound structure obtained with the Zeno model. 

FIG. 15. Surface morphology in kinetic Monte Carlo simulations of a vicinal surface in presence of ES barriers. The steps 
become wavy (a) due to the Bales and Zangwill instability |54[. Subsequently, the ripples break down starting at th e defects 



of the ripple pattern, and three-dimensional features appear on the surface (b). The figure is taken from Ref. [117| and it is 
courtesy of Pavel Smilauer. 

FIG. 16. Columnar growth. 

FIG. 17. Droplets of the Volmer- Weber type (above) and of the Stranski-Krastanov type (below). The shape, and particularly 
the angle 8, is independent of the size. 

FIG. 18. [110]au zone axis Fourier Filtered HREM image showing a misfit dislocation at an Au/AuNi(001) interface with a 
Burgers vector (1/2) < 101 >. The arrow shows only the projection of the Burgers vector in the (110) observation plane. The 
dashed line is the Burgers circuit around the dislocation core. The excess plane is also shown in black dashed line. Courtesy 
of C. Dressier and J. Thibault, DRFMC/SP2M, CEA-Grenoble. 

FIG. 19. Schematic cross-sectional view of a Shockley partial dislocation of Burgers vector (1/6) [112] which has glided in 
a (111) plane from the surface to the interface. A stacking fault appears in the glide plane between the dislocation and the 
surface, so that lattice planes have a discontinuity when crossing the glide plane. 

FIG. 20. Schematic cross-sectional view of a dislocation of Burgers vector (1/2)[110] which has climbed from the surface to 
the interface with the introduction of two supplementary planes (atoms in grey). The arrows represent the displacement field 
in the epilayer. 

FIG. 21. Schematic cross-sectional view of a perfect dislocation (1/2) [101] which has glided in a (111) plane from the surface 
to the interface. The arrows represent the displacement field in the epilayer. There is no lattice discontinuity at the glide plane. 

FIG. 22. A dislocation of Burgers vector b moving in its glide plane. 

FIG. 23. Critical thickness versus misfit for the glide system of Burgers vector (1/2) [101] and glide plane (111) in the FCC 
lattice of lattice parameter ao for two values of the core parameter a (Dashed curve: a— 2, continuous curve: a=4). 

FIG. 24. A dislocation loop nucleating at the surface. An atomic step appears at the intersection of the glide plane with the 
surface. 

FIG. 25. Total energy of a half-loop vs radius for 90° partial and 60° perfect dislocation in Au8oNi2o/Au(001). 

FIG. 26. Schematic representation of 2 arrays of orthogonal alternating dislocations. 



FIG. 27. Curves showing the total energy density of dislocations versus the residual elastic strain in the epilayer (lower axis) 
or versus the dislocation density (upper axis). 

FIG. 28. Blocking mechanism for the relaxation of MgO grown on Fe(OOl). Circles: experimental data from RHEED 
measurements. Continuous lines : equilibrium and blocking models. 

FIG. 29. Blocking mechanism : the progression of a threading dislocation T>\ may be disturbed by the stress field of a 
perpendicular dislocation 2?a lying at the interface, so that it bypasses this obstacle at a depth h* . 

FIG. 30. Residual strain needed to overcome the line tension (eu ne ), the orthogonal dislocation stress field (en) and both 
retaining forces (e to t) as functions of the channel width h* . The value e mirl corresponds to the possible residual strain before 
blocking of further moving dislocations. 

FIG. 31. Two ways of relaxing the stress in an adsorbate. (a) Creation of a misfit dislocation D by a microscopic motion 
of atoms in a 'glide plane' inside the material, (b) Creation of waves at the surface by atom diffusion on the surface on long 
distances. 

FIG. 32. Experimen tal i mage of a modulation arising from the misfit in a Gai-^In^As multilayer on an InP(OOl) substrate 



x ~ 0.28), from Ref. [154|. The dark layers are under tensile stress (5a/ a ~ —1.7%). The bright layers are lattice matched 



to InP and are used as growth markers. Although the quantitative theo ry ca n be different in a multilayer, the mechanism is 
probably the same as described for a single adsorbed layer in paragraph 



KB 



FIG. 33. Experimental images of clusters of InAs on an InP(OOl) substrate: (a) plane view and (b) cross-sectional observa- 
tions. The adsorbate (InAs) has a lattice constant 3.2 % greater than the substrate (InP). Small clusters are dislocation-free 
while bigger ones are not, as most clearly seen from the lower picture 

FIG. 34. Schematic representation of clusters of increasing size which arise from elasticity in coherent, epitaxial growth. The 
shape and the contact angle depend on the size. Small clusters are one monolayer thick while bigger ones are 'three-dimensional' 
with an increasing contact angle when the size increases. The spherical shape assumed for three-dimensional clusters is not 
realistic. In practice, almost detached clusters such as the right hand one have never been observed in the dislocation-free state. 

FIG. 35. Sinusoidal modulation of a surface (a) under the influence of an anisotropic, mechanical stress and (b) under the 
influence of an adsorbate. 



FIG. 36. A slab and a truncated pyramidal cluster. 
FIG. 37. A Ge cluster arising from the deposition of Ge on Si(001). STM image kindly communicated by Max Lagally. 

FIG. 38. The function G(x). 
FIG. 39. Growth of a submonolayer. 

FIG. 40. The Lifshitz-Slyozov mechanism. Atoms leave the small clusters to go to the big ones where the free energy per 
atom (i.e. the chemical potential) is lower. 



FIG. 41. Microscopic twins in Auo.sNio.s layers grown epitaxially on Au(OOl). High resolution electron microscopy 
cross-section image, the beam direction is [110]. 

FIG. 42. AFM image of the surface of a FePd epilayer deposited on Pd(001). The steps oriented at about 45° with respect 
to the edges of the figure correspond to the emergence of the microscopic twins at the surface. 

FIG. 43. Schematic representation of a microscopic twin due to the gliding of several Shockley partial dislocations on 
successive (111) atomic planes. 

FIG. 44. Nucleation lengths (£ n ) for the different kinds of terraces in a 1+1 dimensional model, as a function of the ES 
length (£es)- All the lengths are expressed in units of the diffusion length £r>- Starting from the top, we have iff (which does 
not depend on £es), £n (which goes to th e co nstant value 1/V2) and £„ (which slowly goes to zero). The behaviour of £„ at 
large £es can be directly found from Eq. (B4): £„ ~ (I/Mes) 1 ■ The small exponent (1/3) explains the slow decrease of £^. 



FIG. 45. (a) and (b) represent two similar adsorbed clusters (C), (C). The strain in two corresponding points is identical, 
(c) Appropriate external forces within a thin slab (L) just below the surface produce the same elastic displacements and strains 
in the substrate as the cluster (a), (d) The slab (Z/) deduced from (L) by the similarity transformation produces the same 
effect as the cluster (C). On the other hand, the strain at long distance ris multiplied by A 3 . 
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